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Preface 



Logic has been heralded as the basis for the next generation of computer sys- 
tems. While logic and formal methods are indeed gaining grounds in many 
areas of computer science and artificial intelligence the expected revolution 
and breakthrough has not happened as yet. Notwithstanding the object ori- 
ented paradigm programming as well as processor design is still done in an 
imperative way, which has far-reaching consequences for the quality of soft- 
ware and engineering products. 

A logical approach instead would offer many advantages such as machine- 
checked correctness, quick adaptability to design changes, dramatic reduction 
of maintenance costs, understandability of design, a far-reaching potential for 
the automation of the synthesis of the product from the design constraints, 
and so forth. Why then does not everyone follow the logical approach? 

In the eighties is was beginning to dawn on the logic community that for 
most applications logic, as used then, might lack a vital ingredient which, 
on the other hand, is inherent in imperative languages and which no one 
would want to miss. What logic lacks is a simple and natural way to describe 
actions and change without facing inherent problems. In AI these problems 
center around what is called the frame problem. Without a solution to the 
frame problem — and its cousins — logic would continue to suffer from this 
shortcoming. Fortunately, the frame problem finally has been solved in such 
a way that the drawback is disappearing. 

By now there is a number of formal variants of the solution to the frame 
problem. One consists in describing actions and change within the fluent 
calculus, a first-order Prolog-like formalism. The fluent calculus forms the 
basis of the contents of this book. It thus sets out from a basis which has 
overcome the drawback which held logic back for many years. 

As mentioned there were the cousins of the frame problem yet to be tack- 
led for a completely satisfactory solution. In particular, it is the so-called 
ramification and the qualification problem which belong to this family. Pro- 
fessor Thielscher in this book offers a convincing solution to these two ac- 
companying challenges. I am deeply convinced that, as a consequence, there 
will be a renewed and strong interest in the logical approaches once these 
solutions will have become more widely known. I therefore wish this book 
the success it deserves for several reasons. 
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One of these reasons is the attractive mix of illustrative examples and 
formal precision, which makes this book easily accessible to a wide-spread 
readership. Another reason consists in the deep insight provided by the book 
into a fascinating topic which is central to our human thinking and has been a 
(mostly philosophical) issue for at least two thousand years. But the main rea- 
son is the level reached by the combined solutions to the frame problem and 
its cousins. Now programming in logic may comfortably include commands 
which call for logically defined changes without compromising in logical rig- 
orousness. Similarly, engineering design, which always involves change, may 
now naturally be formalized in a logical setting with all the attractive ad- 
vantages mentioned above. This includes the logical specification of agents in 
networks or autonomous robots which exchange information with each other 
as well as with human users on a most comfortable linguistic level. 

It usually takes a number of years until fundamental insights diffuse 
through the community to a degree that the potential consequences material- 
ize. May this book speed-up this diffusion process by finding many interested 
readers who spread out the news about another dawn of logic in computer 
science and artificial intelligence. 



Wolfgang Bibel 




Prologue 



When John woke up he felt an uneasiness as so often these days. An instant 
later, however, it came to him that there was no reason for worrying anymore. 
The project he and his colleagues at the lab for months had sacrificed nearly 
everything for had finally come to a successful end the day before. So John 
relaxed, closed his eyes, and let his mind wander over the whole course of 
events again. 

At the beginning there was this robot which was capable of performing 
rudimentary tasks such as moving around obstacles, grasping and handling 
objects, even climbing stairs (though it looked a bit clumsy to the attentive 
observer). Yet the robot was completely lacking the ability to solve tasks be- 
yond these primitive ones on his own, that is, without John and his colleagues 
devising and telling him a minute plan of how to combine elementary actions 
in order to get the job done. A project was therefore established aiming at 
providing the robot both with insight into his own capabilities and with the 
ability to build a cognitive model of his environment. This, John argued, 
would enable the robot to do planning all by himself by means of reasoning 
when he has a certain goal in mind, that is, by drawing the right conclusions 
from what he knows as to the effects of his actions and from what his sensors 
tells him about his surroundings. A catchy name always being the basis for 
success of a project, they finally agreed on the acronym Elaser, meaning 
Effective Logically Acting and Sensing Robot. 

It was obvious from the beginning that when explaining Elaser the ef- 
fects of his actions it was impracticable to enumerate all conceivable situations 
in which an action can be performed and to state the result of its execution 
separately for each such situation. Actions had rather to be described by some 
sort of laws which specify the effects in general terms. In this way Elaser 
learnt, for instance, that grasping an object and carrying it from A to B al- 
ways causes the object to be at location B and no longer at location A. In this 
context it turned out vital to provide the robot with a piece of information of 
universal nature. Namely, whenever he was told an action had such-and-such 
effects, he should assume this description be complete and so to conclude 
that moving around printed matter, for instance, does not change its con- 
tents. In order to test Elaser whether he had really grasped this crucial 
point, John asked him one morning to get a copy of the free local newspaper. 
The robot obediently walked out of the lab, spotted the right paper among 
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different ones lying around in the front yard, correctly concluded that taking 
the newspaper into the building would not alter its being the local one, and 
so delivered it to John, who considered this an undeniable success. 

Was it intuition or mere coincidence that John wanted to double-check 
the next morning? The night had brought some rain and it was still drizzling 
when Elaser left the building. Anxiously watching the robot out of his win- 
dow, John saw that all newspapers were wrapped in transparent protective 
covers. To his great astonishment, Elaser ripped open the cover of one of 
the packages, tossed it away, and came back to the lab with a copy of the 
local newspaper soaked through. Criticizing the robot for his behavior, John 
and his colleagues learnt that he had no other choice: After all, Elaser ex- 
plained to them, had he picked up and delivered the cover, the newspaper 
would still be lying in the front yard. For the single effect of carrying around 
an object like the cover is that only this very object changes its location, or 
so they had told Elaser. 

Back to the drawing board. Apparently, the description of what happens 
if objects are moved needs to be split into two cases. Either there is no second 
item inside, or else there is, in which case both change their location. But what 
if a third object were placed inside the second one? This seems to require just 
another rule, which, however, still does not cover the case of four interlocking 
items, and so on (the alarming picture of an infinite Matryoshka, a nest of 
innumerable wooden puppets, entered John’s mind). Pacing restlessly up and 
down his office racking his brain over this problem, John’s eye fell on a book 
that had just been mailed to him with a note attached saying that it might 
be useful for their project. Could it be that a solution to their problem can 
be found in there, John thought, and so he opened the book and began to 
read. At least the introductory chapter seemed promising to him. The author 
first presented a basic theory of actions. He showed how to formally describe 
actions, including non-deterministic ones like rolling a dice, by specifying their 
general effects and applicability conditions. Furthermore, it was illustrated 
how to exploit this knowledge when reasoning about specific situations. The 
whole theory revolved around the paradigm that each action specification 
concentrates on what the action potentially affects, so that non-effects are 
to be inferred rather than being part of the description. The author called 
this adequacy of action specifications. So far, so good, John thought, but 
what if it is overly strict to suppose that nothing outside an action law is 
affected? Soon after starting off reading the second chapter, John realized 
that the latter was in fact entirely devoted to this question. There may be 
more to the impact of actions on the environment, so the author argued, 
than what is specified in action laws, which refer to the direct effects only. 
Actions may, however, have additional, indirect effects, which derive from 
general dependencies, or constraints , among the various properties that are 
used to describe the state of the environment. If John understood correctly, 
then this means, for instance, that an indirect effect of carrying around the 
cover is that the newspaper is being relocated, too. This additional effect is 
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triggered by the general fact that two objects being stuck together can never 
be at different places. Accounting for indirect effects of actions, so the book 
continued, requires to meet two main challenges. First, the assumption needs 
to be suitably weakened which says that actions affect nothing but what 
is mentioned in action laws. Second, the aforementioned constraints often 
suggest, from the mere formal perspective, indirect effects which would never 
occur in reality and, hence, need to be sorted out. This was illustrated by an 
instructive example where toggling a light switch is concluded to have the 
magical side effect that another switch jumps its position rather than that 
light turns on, which one should have expected. John learnt that these two 
aspects together are commonly referred to as the “Ramification Problem.” 

With growing enthusiasm John kept on reading in hope of encountering 
a solution. As a matter of fact he found more than one. Unfortunately, all of 
them seemed perfectly reasonable to him — but only up to the point where 
the author proved their limited applicability. The author did so by discussing 
several scenarios for which the respective ‘solution’ either missed an obvious 
indirect effect (just like Elaser did in concluding that he had better rip open 
the cover of the newspaper, John thought), or proposed rather funny effects, 
which could never occur in reality. Finally, being faced with all these failures, 
the author introduced the concept of relations each of which directly links a 
single cause with a single effect. These causal relationships form the basis for 
the generation of indirect effects: Whenever a cause is brought about, then 
the additional occurrence of the effect is reckoned with. In this way all and 
also no ‘phantom’ indirect effects are obtained, so the author argued, pro- 
vided, of course, the formal causal relationships both soundly and completely 
reflect causality in reality. Well, thought John, that is convincing, so we just 
have to provide Elaser with the knowledge that a change of an object’s 
location causes any object inside to change its location as well. However, he 
was not looking forward to telling his colleagues that they have to draw up 
by hand and feed Elaser all necessary causal relationships. Fortunately the 
author showed how these relationships can be automatically extracted from 
much more general knowledge. The chapter concluded with an axiomatiza- 
tion in formal logic of the whole theory of causal relationships as means to 
solve the Ramification Problem. To John’s satisfaction, he noticed that this 
axiomatization was based on the same principle they had used for design- 
ing Elaser — a principle which was called “Fluent Calculus” in the book. 
So they seem to have made a lucky decision. John remembered how difficult 
it was in the beginning to convince his colleagues of the advantages of this 
Fluent Calculus over the other’s two favorites, the so-called Situation Calcu- 
lus and Event Calculus, respectively, when it comes to inferring non-effects 
of actions. The author of the book shared his conviction, and so John and 
his colleagues just had to put the proposed axiomatization on top of the one 
already existing in Elaser. 

Having redesigned Elaser following the instructions, the project mem- 
bers sent out the robot to fetch a newspaper each and every morning, and 
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the robot did his duty worthily. He always came back with the right paper, 
plain on sunny days and safely wrapped in a protective cover whenever it 
had been raining. Until this one day which John will never forget. As usual, 
Elaser had left the lab sometime in the morning. Eventually, however, John 
and his colleagues realized that his return was long overdue and still there 
was no sign of him coming back. Anxiously recalling the disastrous morn- 
ing when Elaser delivered the soaked newspaper, John followed the robot’s 
path to the front yard. Standing next to a package with the local newspa- 
per inside, Elaser was totally paralyzed; even when John enquired of him 
what had happened there was no reaction at all. John’s last hope was that 
investigating the package lying nearby would shed some light upon the mat- 
ter. Indeed he made a surprising discovery. Some rascal, who presumably 
had watched Elaser picking up a newspaper every morning, had teased the 
robot by introducing a brick into the package, which thus was too heavy 
for poor Elaser. Still, however, this did not account for the total blackout. 
When they had managed to run the robot again, he explained to John and his 
colleagues that he knew the only precondition of picking up an object is that 
it must be reachable. Now that was clearly the case when Elaser tried to 
lift the package, so the formal specification implied, with unerring logic, that 
success of this action is guaranteed. Nonetheless the expected effect failed to 
materialize, which entailed a logical contradiction so that Elaser’s whole 
conception of the world broke down instantaneously. By the next morning, 
they had taught the robot that a second precondition for being able to lift an 
object is that it is sufficiently light. But then they watched Elaser anxiously 
ripping open the protective cover around the newspaper again, this time in 
order to make sure that there be no brick or any other heavy item inside. 
To John that seemed rather ridiculous. After all, it is highly unlikely that a 
newspaper package cannot be fetched on account of its weighing too much. 

Back to the drawing board again. The problem was to find a suitable way 
of specifying action preconditions which need not be verified each time prior 
to assuming that the action in question be executable. John turned to the 
book which had already served him so well. Indeed the third chapter, entitled 
“The Qualification Problem,” was devoted to exactly their new question. The 
author started off with arguing that in real-world environments most actions 
have many more preconditions than one is usually aware of. The reason for 
this unawareness is that most of these conditions are so likely to be satis- 
fied that they are assumed away unless there is evidence to the contrary. 
Conditions of this sort are called “abnormal,” and if their presence prevents 
the successful performance of an action, then the latter is considered “abnor- 
mally disqualified.” Any particular situation, then, is reasonably presupposed 
to being ‘normal’ as long as this does not conflict with what is known or has 
been observed. John vaguely remembered this as a standard technique to deal 
with assumptions that are made by default because they are usually — but not 
necessarily always — correct. Yet the author illustrated that making the right 
assumptions in the context of the Qualification Problem is trickier than usual. 
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Roughly speaking, this is a consequence of the dynamics inherent in action 
theories, which implies that abnormalities may naturally arise for reasons of 
causality. If John understood correctly, the crucial point was the following. 
Suppose, for instance, Elaser had been told in advance that somebody had 
planned to add a brick to the newspaper package. Then it would have been 
reasonable for the robot to assume that this action had been successful and, 
hence, lifting the package would have been disqualified thereafter. But the 
application of the aforementioned standard technique would equally well sug- 
gest another course of events, namely, that introducing the brick is impossible 
in the first place due to some mysterious unspecific reason. Therefore any so- 
lution to the Qualification Problem, the author argued, requires an account 
of abnormal preconditions of actions which are brought about as side effects 
of previous actions. Side effects being nothing else than indirect effects, the 
preceding solution to the Ramification Problem turned out to furnish a ready 
fundamental for a solution to the Qualification Problem. The book continued 
with showing how to seek explanations in case an action surprisingly fails at 
some point. Even the rare case of inexplicable disqualifications, “miraculous” 
they were called, had been considered. Like the previous one, the chapter on 
the Qualification Problem concluded with an axiomatization in formal logic 
of the entire action theory. 

Before John called an assembly of the project group in order to announce 
that he had found the solution to their new problem, he read through the 
final, comparably short chapter. The author expanded the connection be- 
tween the Ramification and Qualification Problem even further. Just like 
actions may turn out unqualified for some abnormal reason, so the argument 
went, there may happen exceptions to the occurrence of indirect effects. This 
time the existing solution to the Qualification Problem in turn furnished a 
ready approach to this generalization of the Ramification Problem. 

Finally, John thought contentedly daydreaming in his bed, with the in- 
valuable assistance of the Book they had brought the project Elaser to a 
successful end. After having redesigned the robot so as to being capable of 
coping with the Qualification Problem, they continued making all kinds of 
tests for weeks. Elaser had passed them all effortlessly. Thinking of that, 
John happily nodded off again, with a hardly noticeable smile on his face. He 
dreamed about Elaser strolling around campus when he suddenly bumped 
into a famous philosopher, who straight away started arguing that the robot 
lacks conscious understanding of anything he is doing. When Elaser de- 
manded a precise definition of what he meant by conscious understanding, 
the philosopher finally defined it as being a property only carbon-based brains 
can possess. I can live with having no understanding of that kind, Elaser 
thought walking off with a slight shake of his head, leaving the philosopher. 

This, however, is not the story of Elaser. Nor is this the story of John. 

This is the Book. 
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1. Foundations of Action Theories 



1.1 Purpose 

Before we discuss the purpose of action theories, let us try to provide a 
suitable and compact definition of the subject in question: 

An action theory consists of a formal language that allows adequate spec- 
ifications of action domains and scenarios, and it tells us precisely what 

conclusions can be drawn from these specifications. 

Of course this informal definition cannot be appreciated without further clar- 
ification of the crucial terms used therein. To begin with, by “action domain” 
we mean any aspect of the world worth formalizing in which the execution 
of actions plays a central role. This is the case, for instance, if one wants 
to model an agent that interacts with its environment, i.e., the part of the 
world it is able to affect. Most importantly, an autonomous agent needs pre- 
cise knowledge as to the effects of its actions in order to act purpose-oriented 
and so to achieve pre-determined goals. The latter requires to draw the right 
conclusions from this knowledge in view of particular situations, in which the 
agent has acquired partial knowledge about the current state of the environ- 
ment and has a certain goal in mind. 

By “action scenario” we mean exactly these particular situations: Given is 
some information as to the current, the past and/or even a future or counter- 
factual state of the world. The task then is to appropriately interpret these 
observations so that the right conclusions can be drawn. Action theories 
provide this. They include a formal entailment operation that determines the 
set of conclusions which a scenario within an action domain allows. 

Both a general action domain and a particular scenario are specified using 
the formal language underlying an action theory. This language determines 
the expressiveness of the theory. There exist action theories, for instance, 
that support the specification of actions with non-deterministic effects (such 
as rolling a dice), others don’t. Whether or not a certain action theory is 
suitable for a particular application depends on the expressiveness required. 
If, for instance, it suffices to consider discrete state transitions only, then 
there is no need to employ an action theory designed for modeling continuous 
change. The two aspects discussed in this book, however, viz. indirect effects 
and qualifications of actions, are arguably fundamental issues and need to be 
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incorporated by any action theory meant to address other than artificially 
simplified domains. 

Finally, the innocent word “adequate,” which the reader may even have 
overlooked in the above definition, is probably the most crucial aspect. It 
means that the specification of actions and their effects shall be as natural 
as possible. For example, it would not only be inconvenient but clearly most 
unnatural to explicitly state the result of executing an action in every possi- 
ble situation. Rather one wants to specify that, say, carrying the newspaper 
always causes it to change its location — no matter what color it is or whose 
party the current President belongs to etc. Likewise, one would want to avoid 
re-specifying the effects of transporting an object in case it contains (or is 
underneath or attached to etc.) another object. Rather the fact that this 
additional object also changes its location should be inferable from general 
knowledge of the world. In providing all this, action theories always include 
a more or less implicit general notion of time, change, and causality. This 
indicates that the adequacy requirement is what makes action theories so 
special — and it is what this book is all about. 

Action theories have much in common with logic. They are based on a for- 
mal language and they include an entailment relation among the expressions 
in this language. This relation determines the way conclusions are drawn from 
specifications. Yet entailment in action theories is somewhat different from en- 
tailment in so-called general purpose logics, such as classical first-order logic, 
say. The reason is that the entailment relation reflects the special notions 
of change and causality inherent in action theories. This usually makes the 
formal definition of how to draw conclusions much more complex compared 
to the majority of general purpose logics. It also means that any enrichment 
of the ontology of an action theory necessitates changes in the definition of 
entailment. 

Both the fair complexity and frequent changes of the notion of entailment 
in action theories constitute an important drawback in view of automating 
reasoning. This favors general purpose logics as means to this end. Research 
in automated deduction in first-order logic, for instance, has made noticeable 
progress in the past decades. It might be unwise not to exploit this devel- 
opment for automating reasoning in action domains. Fortunately this can be 
achieved without losing the major advantage of action theories, namely, their 
naturalness when it comes to formalizing domains and scenarios. What needs 
to be done towards this end is to axiomatize in, for instance, first-order logic 
the characteristics of an action theory. In other words, the implicit theory 
of change and causality is to be made explicit. It became common to call 
“foundational axioms” the resulting encoding that characterizes a particular 
action theory. Additional axioms then represent knowledge about a specific 
action domain and scenario. Together they provide a (hopefully) suitable 
encoding which allows to draw all conclusions suggested by the underlying 
action theory but by means of a general purpose logic. Notice that the ques- 
tion whether such an encoding is suitable is a precise mathematical problem 
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and is solved by proving whether or not the drawable conclusions are always 
identical with what is entailed in terms of the action theory. This providing a 
justification for an axiomatization is a major purpose of action theories and 
the striving force for developing them. More to this in the brief historical 
account in Section 1.3. 

In this book, we concern ourselves with two most fundamental challenges 
for action theories, namely, to account for indirect effects of actions (the 
Ramification Problem) and to treat unlikely but not impossible action dis- 
qualifications in a natural way (the Qualification Problem). We develop a 
uniform action theory that provides solutions to both problems. We further 
present a provably correct axiomatization of our theory by means of a gen- 
eral purpose logic, viz. classical first-order logic in the first part, i.e., for the 
Ramification Problem only, and classical logic augmented by a nonmonotonic 
feature for our entire theory.^ 

A Word on the Notation 

The only expertise needed to understand all parts of the book is some basic 
knowledge of classical logic. But even this is not required except for Sec- 
tions 2.9 and 3.6, where our action theory is axiomatized in formal logic. 

We use the standard logical connectives, stated in order of decreasing 
priority, -i (negation), A (conjunction), V (disjunction), D (material im- 
plication), = (equivalence), V (universal quantification), and 3 (existential 
quantification). Both predicate symbols and constants start with a capital 
letter whereas function symbols and variables are in lower case, sometimes 
with sub- or superscripts. Free variables in formulas are assumed universally 
quantified unless indicated otherwise. Special symbols used in example do- 
mains are printed in typewriter style, like turkey, potato, or tank-empty. 
We further use the basic set operations and relations U (union); fl (inter- 
section); \ (difference); G (membership); C, D (sub- and superset); and 

2 (proper sub- and superset). All other symbols used in this book are 
explained at the time of their first appearance. 



1.2 A Basic Action Theory 

Having said much about action theories in general, we now get more for- 
mal and introduce a particular, elementary action theory containing all of 
the necessary basic ingredients. First of all, any action theory must provide 
means for describing states. A state is a snapshot of the part of the world 
being modeled at a particular instant of time. State descriptions need to be 
composed of atomic propositions. This is vital for our purposes since actions 

^ We use Default Logic, to be precise; see Section 3.6. The reason for pure classical 
logic being insufficient in view of the Qualihcation Problem will be given in due 
course. 
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typically affect only a small fraction of the environment — and in order to 
concentrate on this fraction when specifying the effects of an action we need 
access to it. Otherwise, i.e., if states are represented as abstract objects with- 
out bearing an internal structure (as it is typical for automata theory, for 
instance), the impact of an action could only be specified by a complete state 
transition table. This would violate the most fundamental requirement for 
adequacy. 

Atomic propositions represent properties, or in general relations among 
entities, which do or do not hold in a particular state. The truth-value of any 
such proposition may change in the course of time as a consequence of a state 
transition. Due to this dynamic characteristics the term “fluent” has been 
established as a name for these propositions. Thus a state is characterized by 
saying which of the various fluents are true and which are false in this state. 

Definition 1.2.1. Let £ he a finite set of symbols called entities. Let T 
denote a finite set of symbols called fluent names, each of which is associated 
with a natural number (possibly zero) called arity. A fluent is an expression 
/(ei, . . . , e„) where f € if is of arity n and ei, . . . , e„ € £. A fluent literal 
is a fluent or its negation, denoted by ~'/(ei, . . . , e„) . A set of fluent literals 
is inconsistent if it contains a fluent along with its negation, otherwise it is 
consistent. A state is a maximal consistent set of fluent literals. m 

Before we continue, let us illustrate these concepts with a small example. 
At several places in this book we will model the behavior of electric circuits. 
Suppose a particular circuit consists of two binary switches, a light bulb, 
and a battery. The various states of this system may be described using the 
two entities £ = {s;]^,S 2 } representing the two switches, along with the 
unary fluent up denoting the position of its argument, and the nullary fluent 
light denoting the state of the bulb.^ Then both up(s 2 ) and -ilight, say, 
are fluent literals, and {-'up(s;]^), up(s 2 ), “■light} is a maximal consistent 
set of fluent literals, hence a state. 

The reader may have noticed that formally any combination of truth- 
values constitutes a state. Later in this book, in Chapter 2, we will provide 
means to distinguish states that cannot occur due to domain-specific depen- 
dencies among fluents. For convenience, we introduce the following notational 
conventions: If £ is a fluent literal, then by ||£|| we denote its affirmative com- 
ponent, that is, ||/(e)|| = ||“'/(e)|| = /(e) where f £ IF and e is a sequence 
of n entities with n being the arity of /. This notation extends to sets of flu- 
ent literals S as follows: US'!! = {||f|| : £ € 5} . E.g., whenever S' is a state, 
then ||S|| is the set of all fluents. If £ is a negative fluent literal, then -■£ 
should be interpreted as p||. In other words, “■“■/(e) = /(e). Finally, if S 
is a set of fluent literals, then by -•S we denote the set {-■£ : £ G Sj. E.g., 
a set S of fluent literals is inconsistent iff S fl -iS yf {}. 

The second fundamental notion in any action theory are the actions them- 
selves. Actions cause state transitions when being performed. As indicated 

^ The curious reader may take a peek at Fig. 2.3 on page 22. 
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above, a fundamental assumption concerning actions is that they always af- 
fect just a small fraction of an entire state. The adequacy requirement dictates 
that action specifications concentrate on this fraction only. Recall that the 
decision to split states into fluents has been made with exactly that purpose 
in mind. Describing the effect of an action thus amounts to specifying which 
fluents change their truth-value when the action is being executed. For ex- 
ample, switching off the second switch, S2, always has the effect that the 
fluent up(s2) becomes false, regardless of the values of all other fluents. We 
shall formally write this as 

switch-of f-S2 transforms {up(s2)} into {->up(s2)} ( 1 - 1 ) 

This means that action switch-off-S2 can be executed in any state S 
which satisfies up(s2) G S, and the resulting state is obtained by replacing 
up(s2) by -iup(s2) in S. So if, for instance, we switched off S2 in the 
above state, {-■up(s]^), up(s2), “■light}, then the resulting state would be 

{^up(si),-.up(s2),-'light}.3 

Specifying our example action was particularly easy because its effect is 
always the same. It is not hard to imagine more complex actions that produce 
different effects, depending on the state in which they are executed. Suppose 
we want to describe what happens if we toggle the second switch. Obviously 
there are two cases to be considered. First, if the switch is currently up it 
is down afterwards, and, second, if it is already down in the beginning, then 
toggling it causes it to be in the upper position. Actions of this more complex 
kind are described by two or more “laws” of the form of ( 1 . 1 ). Toggling the 
second switch, for instance, can be specified through 

toggle-S2 transforms {up(s2)| into {^up(s2)| 

toggle-S2 transforms {-■up(s2)} into {up(s2)| 

Notice that since states are maximal consistent sets, they always contain 
either up(s2) or else -■up(s2), at least in our example domain. In any case, 
therefore, exactly one of the two action laws for toggle-S2 is applicable and 
yields the expected result, namely, that the switch alters its position. 

It is often convenient to specify an action simultaneously for a whole 
collection of similar entities. To this end, action laws may contain variables 
to be substituted by entities. For instance, the two specifications 

toggle(x) transforms {up(a;)} into {-■up(a:)} 

toggle(a;) transforms {-■up(a;)} into |up(a;)} 

describe what toggling means in general. To be more precise, let x denote 

a finite sequence of variables chosen from a given denumerable set V. If x 
contains the variables that occur in some expression then this is written 

® The reader is right who wonders whether this really is all that is to be said as 
to the effect of turning off a switch. More to this in Chapter 2. 
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^[t]. Let X = Xi, . . . ,Xn, then a ground instance of an expression is 

obtained by applying a substitution 9 = {x\ ^ ei, . . . ,Xn ^ e„} to where 
G S are entities.^ Suppose e = ei,...,e„, then ^\x]9 is also 
written ^[e]. For example, the ground instance 

Of[s 2 ] = toggle(s 2 ) transforms {up(s 2 )} into {->up(s 2 )} 

can be obtained from a[x] = toggle(a:) transforms {up(a;)} into {->up(a;)} 
by substituting {x S 2 }.® 

This finally leads us to the formal definition of actions and their spec- 
ification. We call fluent expressions any and its negation 

->f{ti, . . . ,tn) where / is a fluent name of arity n > 0 and each U is 
either an entity or a variable (1 < z < n). 

Definition 1.2.2. Let A be a finite set of action names, each of which is 
associated with a natural number (possibly zero) called arity. An action is a 
ground term a(e) where a G A is of arity equal to the length of e. An action 
law is of the form 



a(x) transforms C[x] into E[x] 

where a G A is of arity equal to the length of x and where C\x] and E\x] 
are sets of fluent expressions which satisfy the following. Both C[e] and E\e], 
for any e, are consistent and ||C'[e]|| = ||if[e]||, that is, C and E always 
refer to the same fluents. If S is a state, then a ground instance a\e] of 
an action law a\x] = a(x) transforms C[x] into E\x] is applicable to S iff 
C[e] C S. The application of a\e] to S yields (S'\C'[e]) \J E\e]; the latter 
is called preliminary successor state of S and a(e).® ■ 

The reader may notice that S being a state, C[e] and E[e] being consistent, 
and ||C'[e]|| = |lif[e]|| guarantee that (S'\C'[e]) U if [e] is a state, too. 

Thus far we have only seen actions which determine at most one prelim- 
inary successor state when being executed. We needed two different action 
laws to specify the effects of toggle(a;), that much is true, but the two con- 
ditions were mutually exclusive so that the two laws are never applicable at 
the same time. Yet Definition 1.2.2 does not exclude the existence of two 
(or more) simultaneously applicable laws for one and the same action. This 
feature is required if one wants to describe actions with indeterminate effects, 
so-called non-deterministic actions. Tossing a coin is a simple action which 

^ Application of a substitution 6 = {xi 1 -^ ei, . . . , x„ Gn} to an expression ^ 
means the replacement, in of any occurrence of variable Xi by entity d 
(1 < i < n). The resulting expression is denoted by ^6. 

® Had we considered entities different from switches and for which the notion of 
toggling would make no sense, such as, say, potatoes (see Chapter 3), then we 
should introduce a unary fluent switch denoting whether or not its argument 
is a switch. This fluent should then be used as an additional condition within 
the two action laws for toggle)®). 

® The reason for calling “preliminary” this state will be revealed in Chapter 2. 
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obviously belongs to that category. But to stick to our current domain, sup- 
pose it is totally dark so that it is impossible to tell the two switches apart. 
Nonetheless we want to switch up one of them (knowing both are currently 
down). Putting this plan into execution, there are two possible outcomes: We 
either hit the first or else the second switch. This may be formalized by the 
two action laws 

switch-one-up transforms {-'up(s;]^)} into {up(s]^)} 

j 

switch-one-up transforms {-■up(s 2 )} into {up(s 2 )} 

According to Definition 1.2.2 these two laws are simultaneously applicable 
to the state {-'up(s;L), -iup(s 2 ), -■light}. Their respective application de- 
termines two different successor states, viz. {up(s -iup(s 2 ), -■light} and 
{-■up(s]^), up(s 2 ), -■light}. In any particular situation of course only one of 
these possibilities will actually occur. Which one this will be, however, cannot 
be predicted, at least not on the basis of the (restricted) knowledge about 
the domain. This is what makes the action in question non-deterministic.^ 
The concept of action laws defining how the execution of actions affects 
the particles of state descriptions, viz. the fluents, provides us with a basic 
formalism to specify action domains. The semantics of these specifications is 
given by complete state transition models. 

Definition 1.2.3. A basic action domain T> is a 4 -tuple {S,T,AtC) where 
£ is a set of entities, T a set affluent names, A a set of action names, and 
L is a set of action laws. The transition model of T> is a total mapping S 
from state-action pairs into (possibly empty) sets of states such that S' G 
S{S,a) iff S' is a preliminary successor of S and a. ■ 

Any (basic) action domain provides general, i.e., situation-independent knowl- 
edge as to the impact of performing actions. Exploiting this knowledge when 
drawing conclusions about particular scenarios within a domain is our next 
concern. 

A scenario is given by information as to particular developments of the 
part of the world which has been specified as action domain. In most cases, 
this information is incomplete in that only for some fluents at some stages the 
truth- values are known. The general task then is to draw the right conclusions 
as to the truth-values of other fluents at other stages. Of course this requires 
knowledge of the general effects of actions, given by the formal specification 
of the underlying action domain. 

As a very simple example, suppose we observe that after switch Sj^ has 
been toggled it is in the upper position. Then it is reasonable to conclude 
that the switch was down beforehand. This should follow from our knowl- 
edge as to the general effects of toggling switches, which is provided by 
the two action laws defined in (1.2). Formally, we will use expressions like 

^ Although non-determinism is not amongst the basic requirements for action 
theories, it will be vital for both the Ramification and Qualification Problem. 




1 . Foundations of Action Theories 



up(sj^) after [toggle(s]^)] to denote what we call observations. A scenario 
can then be modeled as a collection of observations, and drawing the right 
conclusions amounts to deciding which observations follow from the given 
ones. E.g., -iup(s;l^) after [] could be one such conclusion. 

Observations may be more complex than the two examples just men- 
tioned. For instance, up(s;[) = up(s2) after [] says that both switches are 
in the same position — though it is not known what position they share. A rea- 
sonable conclusion here would be, say, -i(up(s;L) = up(s2)) after [toggle(s2)], 
that is, toggling the second switch results in both switches being in different 
positions. Another variant is to ask hypothetical questions like the following. 
Suppose the result of toggling S2 would be that all switches have the same 
position. Suppose further that toggling S;]^ would bring this switch down. 
What, then, follows as for the result of toggling both Sj^ and S2? Clearly, 
the assumptions imply that initially is up and (hence) S2 is down. From 
this we conclude that the right answer to the question is that switch is 
down and switch S2 is up. More formally, we say that the two observations 

Va:.up(a;) V Vx. ->up(a;) after [toggle(s2)] ( 14 ) 

->up(s;l) after [toggle(s;[)] 

entozZ the observation -iup(s^) A up(s2) after [toggle(s j^), toggle(s2)] . 
Again, to reiterate the obvious, the conclusion relies on the correct definition 
of the action laws. In the remainder of this section, we show how conclusions 
of this kind are obtained on a formal basis. To this end, we first need a 
precise definition of observations. This in turn requires the formal notion of 
a so-called fluent formula, such as -i(up(s;L) = up(s2)) or Va;.-'up(x). 

Definition 1.2.4. Given sets of entities, fluent names, and variables, the set 
o/ fluent formulas is inductively defined as follows: Each fluent expression and 
T (tautology) and T (contradiction) are fluent formulas, and if F and G 
are fluent formulas so are ->F, FAG, F V G, F D G, F = G, 3x. F, 
and 'ix.F (where x is a variable). A closed formula is a fluent formula 
without free variables, that is, where variables only occur in the scope of some 
quantifier using this variable.^ m 

Definition 1.2.5. Let £ , T , and A be sets of entities, fluent names, and 
action names, respectively. An observation is of the form 

F after [ai(ei), . . . , a„(e„)] 

where F is a closed fluent formula and each of ai(ei), . . . , a„(e„) is an 
action (n> 0). m 

Definition 1.2.6. A basic action scenario is a pair {0,T>) where V is a 
basic action domain and O is a set of observations (based on the entities 
and fluent and action names in T>). ■ 

® The scope of the quantifiers in 3 x. F and \/x. F is defined as the sub- 
formula F. 
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Prior to defining what conclusions an action scenario allows, we need to 
clarify under which circumstances a particular observation can be said to be 
true. This obviously depends on what state supposedly results from executing 
the action sequence in question. If that state is fixed, then deciding whether 
the fluent formula itself holds is straightforward, following the standard in- 
terpretation of the logical connectives. 

Definition 1.2.7. Let £ and T he sets of entities and fluent names, re- 
spectively, and let S he a state. The notion of a closed formula being true 
(resp. false j in S is inductively defined as follows : 

1. T is true and _L is false in S; 

2. a fluent literal I is true in S iff £ € S ; 

3. -<F is true in S iff F is false in S ; 

4- F A G is true in S iff F and G are true in S; 

5. F V G is true in S iff either F or G is true in S (or both); 

6. F D G is true in S iff F is false in S or G is true in S (or both); 

1. F = G is true in S iff F and G are true in S , or else F and G are 

false in S; 

8. 3x. F is true in S iff there exists some e € £ such that F{x i— e} is 

true in S; 

9. 'ix.F is true in S iff for each e € £, F{x i— e} is true in S. 

Here, F{x i— e} denotes the fluent formula resulting from replacing in F 
all free occurrences of variable x by entity e . ■ 

As an example consider the formula 3a;. ->up(a;) D -ilight, which is true 
in the state {-■up(s;]^), up(s 2 ), “■light} (since -ilight is true) and also in 
{up(sj^), up(s 2 ), light} (since 3a;.-'up(a;) is false), but the formula is false 
in, e.g., {up(si),-.up(s 2 ), light}. 

As indicated, the observations that describe a scenario usually provide 
only incomplete information as to the entire state of affairs. This is espe- 
cially true if non-deterministic actions are considered because then complete 
information means to know the actual result of any possible sequence of non- 
deterministic actions. Thus the normal case is that there is more than just one 
unique state of affairs that fits a scenario description. Following a standard 
terminology in logic, we call any possible state of affairs an interpretation, 
and if the latter matches a scenario description it is called a model thereof. 

We have already seen examples involving reasoning about hypothetical 
developments of the world. An interpretation therefore must not just tell 
us exactly what happens during the execution of one particular sequence of 
actions. Rather it needs to provide this information as to any possible course 
of events. Of course we assume the world always evolves according to the 
underlying action laws. That is to say, whenever some state S results from 
performing some action sequence, and some further action a is executed, 
then the result should be a successor of S and a. 
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Definition 1.2.8. Let {0,T>) he a basic action scenario. An interpretation 
for {0,T>) is a pair {S,Res) where S is the transition model of V and 
Res is a partial function which maps finite (possibly empty) action sequences 
to states and which satisfies the following: 

1. i?es([]) is defined. 

2. For any sequence a* = [oi, . . . , ak-i,ak] of actions (k > 0), 

a) Res{a*) is defined if and only if Res{[a\, . . . ,ak-i\) is defined and 
S{Res{[ai, . . . , Ofe-i]), Uk) is not empty, and 

b) Res{a*) € S{Res{[ai, . . . ,ak-i]),ak). , 

The second component of an interpretation may be depicted as a tree 
(of infinite depth) whose root node contains the initial state, i?es([]), and 
whose branches each characterize the supposed evolution of the world under 
a particular sequence of actions; see Fig. 1.1. 

Interpretations always tell us the exact result of performing any possible 
action sequence. It is therefore straightforward to determine whether an ob- 
servation is true with regard to a particular interpretation: First of all, it can 
be true only if the state is defined which results from performing the sequence 
of actions in question. If, moreover, the fluent formula in question is true in 
that state, then the observation itself is true.® 

Definition 1.2.9. Let {E, Res) he an interpretation for a basic action sce- 
nario {0,T>). An observation F after [ai,...,a„] (n > Oj is true in Res 
iff Res{[ai , . . . , o„]) is defined and F is true in Res{[ai , . . . , o„]). ■ 

For the purpose of illustration, the reader may verify that the two obser- 
vations (1.4) from above are true in the interpretation depicted in Fig. 1.1, 
but not the observation up(s;j^) after [toggle(s;]^), switch-one-up] , nor the 
observation -ilight after [toggle(s 2 ), switch-one-up] because the result 
of the latter action sequence is undefined. 

Among all possible interpretations for the underlying action domain we 
are especially interested in those which satisfy all observations of a specific 
scenario. As indicated, these are called the models of the scenario. Models 
help us define what can be concluded from a scenario description, namely, 
any observation which is true in all models of a domain. 

Definition 1.2.10. Let {0,T>) he a basic action scenario. A model of this 
scenario is an interpretation {E,Res) such that each o € O is true in Res. 
An observation o is entailed, written O \=xi o, iff o is true in all models 
of{0,V). m 

This completes the introduction to our basic action theory. Now that we 
have put together all necessary formal concepts, let us give a fully formalized 

® An observation F after a* is considered false in an interpretation Res when- 
ever Res (a*) is undefined (the alternative would be to consider it undefined, 
too), because we take the observation as implicitly asserting that the action 
sequence a* be executable. 
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Figure 1.1. Each interpretation suggests possible evolutions of the system’s 
state. This may be depicted as tree whose nodes are states and whose 
edges are labeled with actions. In the example, the state resulting from per- 
forming toggle(s 2 ) admits only two child nodes, as opposed to, say, the 
root node admitting three of them. The reason is that no action law for 
switch-one-up (c.f. (1.3)) is applicable to the state {up(s]^), up(s 2 ), flight} . 
Thus 7?es([toggle(s2), switch-one-up]) is not defined in this interpretation. No- 
tice further that when constrncting an interpretation one has to select among the 
alternative results whenever a non-deterministic action is performed; in the exam- 
ple the result of [toggle(s]^), switch-one-up] , for instance, has been determined as 
{-iup(s]^), up(s 2 ), -.light} — we would have a different interpretation had we made 
the equally possible choice {up(s]^), -.up(s 2 ), flight} . 



version of our running example. Below and from now onwards, by we 
indicate that ^ is of arity k. 

Example 1.2.1. Let £ = {s;[,S 2 } be the set of entities, T = {up^,light°| 
the set of fluent names, and A = {toggle^, switch-one-up^} the set of 
action names. Furthermore, let C consist of the action laws 



toggle(a;) 


transforms 


{up(a;)} 


into 


{-up(x)} 


toggle(a;) 


transforms 


{-■up(x)} 


into 


{up(x)| 


swit ch-one-up 


transforms 


{-.up(si)} 


into 


{up(sj^)} 


swit ch-one-up 


transforms 


{^up(s2)| 


into 


{up(s2)} 
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then T> — {£,T,AtC) constitutes an action domain. An excerpt of its tran- 
sition model E is shown in Table 1.1. Let O consist of the observations 

Vx.up(x) V Vx. -'up(a;) after [toggle(s2)] 

->up(s;l^) after [toggle(s;L)] 

then {0,T>) is an action scenario. Each of its models (E,Res) satisfies one 
of 

i?es([toggle(si),toggle(s2)]) = {-■up(si), up(s2), -■light} 

(the model depicted in Fig. 1.1 does this, for instance) or 

i?es([toggle(si),toggle(s2)]) = {^up(si), up(s2), light} 

In both resulting states ->up(s]^) and up(s2) hold. Thus we obtain 
O \=v ^up(si) A up(s2) after [toggle(si), toggle(s2)] 

■ 

The, admittedly simple, action theory we arrived at contains all of the 
basics requested at the beginning: The theory provides a language for for- 
malizing both general action domains and specific scenarios. It also includes 
an entailment relation that tells us precisely what can be concluded from 



Table 1.1. The transition model assigns each pair of state and action a set of 
preliminary successor states. 



s 


a 


E{S, a) 


{-.up(si), -■up(s2), -■light} 


toggle(si) 


{{iip(si), -’up(s2), -■light}} 


toggle(s2) 


{{-.up(si),up(s2), -■light}} 


switch-one-up 


1 {up(si),-'up(s2), -■light}, 1 
({-■up(si),up(s2), -■light} j 


{up(si), -■up(s2), -■light} 


toggle(si) 


{{-■up(si), -■up(s2), -■light}} 


toggle(s2) 


{{iip(si)>iip(s2). flight}} 


switch-one-up 


{{iip(si),up(s2), -■light}} 




{up(si),up(s2), light} 


toggle(si) 


{{-■up(si),up(s2), light}} 


toggle(s2) 


{{'ip(si), -■up(s2), light}} 


switch-one-up 


{} 
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a scenario description. Our theory satisfies the most fundamental adequacy 
requirement, namely, that action laws concentrate on the part of the world 
which is affected when the action is performed. On the other hand, the ba- 
sic theory makes two very strong assumptions. First, it requires any action 
law to contain the entire effect of the action it describes, that is, not only 
the direct but also all possible indirect effects. This is a consequence of the 
assumption that no fluent changes which is not mentioned in an action law. 
Second, the theory asserts that an action is guaranteed to succeed once all 
of the specified conditions are true in a state. Both assumptions, their inad- 
equacy, and a much more sophisticated action theory which overcomes these 
assumptions are subject of the remaining chapters. 



1.3 Bibliographic Remarks 

Formal action theories have not been developed until very recently. The un- 
derlying idea of formalizing reasoning about actions and planning, however, is 
much older. In fact, automating the ability of common sense to reason about 
actions and their effects was among the very first issues raised in Artificial 
Intelligence research [75]. There the belief was advocated that multifarious 
intelligent behavior relies on the ability to maintain a mental model of the 
world and to draw the right conclusions about observations and intentions. 

The historically first formal approach to reasoning about actions was a 
pure first-order encoding of some example action domains and scenarios. 
This encoding introduced the so-called Situation Calculus paradigm, which 
satisfies the fundamental requirement for adequacy in that actions are spec- 
ified by their effects. In so doing, the Situation Calculus brings along the 
Frame Problem [74], which denotes the problem both of how to represent, 
in logic, the general assumption that any non-affected fluent keeps its truth- 
value when an action is performed, and of how to reason efficiently with this 
representation.^*^ 

While the earliest Situation Calculus-based encodings were intuitively 
plausible, the solutions to the Frame Problem suggested and employed in 
this context were cumbersome and inefficient [43] . Most of subsequent work 
in this field was therefore devoted to finding better solutions to this prob- 
lem. It is beyond all question that conceivable progress had been made in 
this regard, but it turned out that a considerable price has been paid to- 
wards this end: Axiomatizations of action scenarios seem to become lesser 

Philosophers have dealt with problems related to actions and their effects for 
much longer, in the context of causality. Interestingly enough, however, they 
seem never to have struggled with the adequacy of action specifications. This 
is indicated by the fact that the Frame Problem has not been encountered 
until AI researchers started to investigate action formalisms [22]. Even later 
formal approaches to the causality phenomenon, such as [120, 1], presuppose 
exhaustive state transition functions, 
more to this in Section 2.10 
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plausible the better they tackle the Frame Problem. Triggered by an increas- 
ing number of action formalisms being erroneous in allowing unintended and 
counter-intuitive conclusions, the reliability of action encodings became a 
new major problem. The by far most popular case is the approach proposed 
in [78], which uses so-called circumscription [77], a particular extension to 
classical first-order logic, to address the Frame Problem. Following common 
practice, this approach was validated merely by example. Soon, however, a 
serious flaw of this method was revealed with the help of yet another simple 
example^^ which is treated against the intuition [49].^^ 

The increasing difficulties with assessing the correctness of action for- 
malisms just by appealing to the intuition led to the insight that formal 
validation methods are needed in order to guarantee reliability. The first step 
in this direction was a generalization of an intuitively correct encoding of an 
example domain so that a whole, well-defined class of domains is covered [66]. 
This generalization was accompanied by a formal proof that the encoding of 
any such domain will yield the same results as obtained in the example, hence 
will inherit the intuitive correctness. Although this result was established for 
a very restricted class of domains (compared, e.g., to what is expressible in 
our basic action theory), this was the first time that an action formalism 
itself was subject to a formal proof of its correctness. 

The first action theory that deserves this name in being truly independent 
of a specific axiomatization was the Action Description Language, abbrevi- 
ated A [34].^^ This formalism and our basic action theory have similar ex- 
pressiveness except that A is restricted to nullary (i.e., propositional) fluents 
and does not support non-deterministic actions. In fact, many of our notions 
and notations were borrowed from this first approach. The Action Descrip- 
tion Language was first employed to validate an encoding of action domains 
based on Situation Calculus and using so-called extended logic programs [34] . 
Later, A has been employed for the evaluation of action formalisms such as 
Situation Calculus based on abductive logic programs [24, 20] , Situation Cal- 
culus with so-called successor state axioms (see Section 2.10) and Situation 
Calculus with circumscription, respectively [58], and Fluent Calculus (see 
Section 2.9) based on so-called equational logic programs [108], just to men- 
tion a few. Any of these results verifies correctness of the respective action 
formalisms if applied to domains expressible in A. In order to obtain more 
general assessment results, the original Action Description Language has been 
extended into various directions, among which we again mention just a few. 

This famous example is known as the Yale Shooting scenario; see also Sec- 
tions 3.7 and 2.8. 

Moreover, the improvement to the original formulation proposed in [3], again 
motivated solely by examples, has been proved (by a counter-example) re- 
stricted to a stronger extent than originally claimed [59]. 

First published as [33]. The reader should not be misled by the name; A is not 
just a language but comes along with a formal semantics of how expressions in 
this language are to be interpreted in terms of state transitions, models, and 
entailment. 
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Non-propositional fluents were first introduced in [24]. The dialect Ac al- 
lows to represent and reason about the effects of concurrently performed 
actions [4]. Non-deterministic actions are supported in Ajq, and the lan- 
guage Ancc extends the latter and combines it with Ac [12]. In ATZ, 
too, non-deterministic actions can be formalized and, moreover, it includes 
a basic solution to the problem of indirect effects of actions [57, 40]. The 
extension £q supports incomplete specification of the action sequence that 
actually occurs in a scenario [6] . Information gathering actions can be repre- 
sented in the language Ak [70] . The language £ of [55] is based on a linear 
time structure and allows to represent narratives of events. A good general 
introduction to the Action Description Language and several dialects is [35]. 

The second major class of action theories has its roots in the language 
and semantics developed in [89]. A distinctive feature of this framework, 
which is commonly called “Features-and-Fluents,” is that it allows to rea- 
son about the duration of actions. A thorough and fine-grained hierarchy of 
sub-classes with restricted expressiveness allows precise assessment results 
for different action formalisms. Major evaluation results are reported in [90]. 
Several extensions to the original framework have been developed, e.g., con- 
currency of actions [122], actions with indirect effects [91, 47, 93], and goal- 
orientedness [95]. The reference article [94] offers a general introduction to 
this line of research. 

A first formal comparison between Action Description Language A and 
“Features-and-Fluents” was established in [107], where both theories are 
proved equivalent for a particular class of domains. Of course many more for- 
malisms exist which arguably deserve being called action theories. The Action 
Description Language and the “Features-and-Fluents” framework, however, 
are the only ones that found broader dissemination in two respects: They 
have been employed for the validation of a variety of action formalisms, and 
they have been used as a uniform basis for dealing with a variety of onto- 
logical aspects. Insofar as these and other more specialized action theories 
address the problems we are concerned with in the following chapters, they 
will be described and discussed in detail in due course. 
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2.1 Indirect Effects of Actions 

An invaluable advantage of action laws is that they allow to describe actions 
by their effects rather than by an exhaustive state transition table. How- 
ever, the sole use of action laws quickly becomes unmanageable in complex 
domains, too. For action laws as they stand are supposed to be complete 
in that they specify the entire effect of an action. Yet although there are 
good reasons to assume that an action causes only a small number of direct 
changes, these in turn may initiate a long chain of indirect effects. Recall the 
action of toggling a switch, which, in the first place, causes nothing but a 
change of the switch’s position. However, the switch may be part of an elec- 
tric circuit so that, say, some light bulb is turned off as a side effect, which 
in turn may cause someone to hurt himself in a suddenly darkened room 
by running against a chair that, as a consequence, falls into a television set 
whose implosion activates the fire alarm and so on and so forth. ^ 

Let us state the problem more precisely. Suppose we perform the action 
of toggling the switch in the simple electric circuit depicted in Fig. 2.1. In the 
first place, the action changes nothing but the position of the switch, which 
thus is the direct effect. Obviously, however, a now closed switch means that 
the light goes on.^ This is an indirect effect of our action. This additional 
effect is a consequence of the general relation between the position of the 
switch and the state of the bulb, namely, light is on if and only if the switch 
is in the upper position. This relation is formally described by the fluent 
formula light = up(s]^). Fluent formulas of this kind, which are supposed 

^ A crucial question in this context concerns the distinction between indirect 
effects occurring during a single world’s state transition step and those which 
deserve separate state transitions (also called “delayed” effects). E.g., the above 
may not only be described as “the fire alarm is activated in the successor state 
after having closed the switch,” but also as, say, “the chair is falling in the 
successor state (and presumably hits the television set during the next state 
transition).” As a reasonable, albeit informal, guidance we suggest a single state 
transition should summarize what happens until some agent has the possibility 
to intervene by acting again (stopping the chair from falling, for instance). See 
also Section 2.7 for a discussion on indirect vs. delayed effects. 

^ We tacitly assume that battery and bulb are in working order; more to this in 
Chapter 4. 
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^up(si) 




“ilight 



Figure 2.1. An electric circuit consisting of a battery, a switch, and a light bulb, 
which lights up if and only if the switch is closed. The two dynamic components 
are described by two fluents, both of which are false in the current state. 



to be true in any state, are called state constraints. States satisfying all 
constraints of a domain shall be called acceptable. 

State constraints, as we have seen, give rise to additional effects if vi- 
olated after the direct effects of an action have been considered. Actions 
having indirect effects conflicts with the general assumption that nothing 
changes except what is mentioned in an action law. This can easily be seen 
with our example circuit: Let S = {-'up(s]^), -ilight} be the current state 
as depicted in Fig. 2.1. Performing a toggle(sj^) action in S produces the 
state S' = {up(sj^), -ilight} following the known action laws (1.2) and ac- 
cording to the definition of how to apply them. This is not the expected 
successor state, as is formally evident by its violating the underlying state 
constraint, light = up(sj^). Fortunately we were foresighted enough to call 
“preliminary” states like S' . The fact that often it does not suffice to com- 
pute, via the application of action laws, the mere direct effects is called the 
Ramification Problem.^ 

One straightforward ‘solution’ to the Ramification Problem is to circum- 
vent it. That is, one could stick to the assumption that action laws be com- 
plete in specifying the entire effect. To this end, all indirect effects must 
somehow be compiled into the action laws. This procedure, however, bears 
two major problems demonstrating its inadequacy. First, it may require an 
enormous number of action laws to account for every possible combination 
of indirect effects. To see why, consider a model of an electric circuit where a 
distinguished switch is involved in n sub-circuits each of which additionally 
contains a switch-bulb pair; see Fig. 2.2. Defining all effects of toggling the 
separate switch solely by means of action laws then requires 2”+^ differ- 
ent laws, one for each possible combination of truth-values assigned to the 



® The name shall suggest the picture that a state description literally ramifies 
into various directions, each following a possible chain of indirect effects. 
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-.up(so) 




flight 1 



-.light2 



up(s„) • 







Figure 2.2. An electric circnit consisting of a battery, a separate switch sq, and 
n sub-circuits each containing a switch Si and a light bnlb light^. Exponentially 
many cases have to be considered when specifying the effect of closing sq solely 
by means of action laws. 



switch being operated and the other n switches. An arbitrary example is the 
following: 

toggle(so) transforms {-■up(so), up(si), -.lights, -.up(s 2 ), -■light 2 , .. .} 
into {up(so),up(si),lighti,^up(s2),-'light2,. . .} 

The other laws look no less unpleasant than this one. This is clearly far from 
what we consider an adequate specification. Full adequacy is achieved only 
if the original, simple specification (1.2) of action toggle(a:) also applies 
to switch SQ in this more complex domain, and if the n state constraints 
lightj = up(so) Aup(si) do the rest. Notice that the size of this specification 
is linear (in n) as opposed to the exponential number of action laws. 

The second problem with exhaustive action laws containing the entire 
effect is that the introduction of a new state constraint may require, in the 
worst case, a redefinition of the entire set of action laws used before. Just 
suppose yet another switch-bulb pair up(s„+i), light,^_,_j^ be added to the 
circuit of Fig. 2.2. Then all of the previously carefully designed action laws 
for toggle(so) — and recall their exponential number — would need revision. 

It so seems we have to think about real solutions to the Ramification 
Problem. 



2.2 Minimizing Change 

We have seen that the basic issue with the Ramification Problem is to resolve 
the conflict between the possibility of indirect effects and the postulate of 
persistence, i.e., that a fluent does not change unless it is explicitly so stated 
in the respective action law. The challenge, therefore, is to find a suitably 
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weakened version of this postulate. The general difficulty with this is that 
one has to perform some sort of a tightrope walk. On the one hand, rigorous 
persistence of all non-affected fluents is still required; on the other hand, 
arbitrarily complex chains of indirect effects need to be accounted for. 

Let us try a first step forward on the tightrope. Indirect effects are to be 
considered whenever the application of an action law results in a state which 
violates one or more state constraints. Obviously, such a state perfectly ac- 
counts for persistence — but not at all for indirect effects. On the other hand, 
notice that the successor state we look for, that is, which accounts for all 
indirect effects, is obviously to be found among the acceptable states, i.e., 
those which satisfy all constraints. In the worst case, however, an arbitrary 
one of these states does not at all account for persistence. But now suppose 
we combine the one extreme with the other. Namely, we select among all 
acceptable states the one that shares the most fluent literals with the prelim- 
inary successor state. ^ Then we have accounted both for indirect effects (since 
no constraint is violated) and for persistence to the largest possible extent 
(since no more fluents have changed than absolutely necessary to obtain an 
acceptable state). In other words, and from a more constructive perspective, 
the idea is to take the preliminary successor and to change the truth-values 
of as few as possible fluents — of course without touching one of the direct 
effects — until a state results that satisfies the state constraints. 

To see how this approach works, recall the circuit of Fig. 2.1. We have 
seen that toggling the switch in the current state, viz. {-iup(s]^), -ilight}, 
yields the preliminary successor S' = {up(s;[), -ilight}, which violates the 
underlying state constraint, light = up(sj^). The state closest to S' in 
satisfying this formula and in still containing the direct effect, up(sj^), is 
T = {up(s;l^), light}. This is indeed the intended and intuitively expected 
successor state: The light went on as indirect effect of closing the switch. 

The following formal account of this approach is based on a comparative 
notion of distance between states. We introduce the concept of minimizing- 
change successors, which are obtained according to the above description. 

Definition 2.2.1. Let V be a basic action domain. If S,T,T' are states, 
then T is closer to S than T' , written T T' , iff T \ S ^ T' \ S . Let 
C be a set of state constraints, S a state which is acceptable (wrt. C), and 
a an action. A state T is minimizing-change successor of S and a iff the 
following holds: There exists a preliminary successor S' of S and action a 
obtained through direct effect E such that 

1. ECT, 

2. T is acceptable, and 

3. there is no T' -<s’ T such that ECT' and T' is acceptable. ■ 



^ Actually this state ‘closest’ to the preliminary successor need not be unique, 
see below. 
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The condition T\S <^T'\S for T states that S and T differ in 

strictly less fluent literals than S and T' do. The definition then says that a 
minimizing-change successor is a state which contains the direct effect of the 
action in question (clause 1), which satisfies the state constraints (clause 2), 
and which is closest to a preliminary successor S' in so doing (clause 3). 

Example 2.2.1. Let T> be the domain consisting of entity S;]^, fluent names 
up^ and light®, action name toggle^, and the familiar action laws 

toggle(x) transforms {up(x)} into {-'up(x)} 

toggle(x) transforms {->up(a:)} into {up(a;)} 

Along with state constraint C = {light = up(s]^)}, this formalizes our 
circuit of Fig. 2.1. 

Let the current state be the acceptable {-■up(s]^), -ilight}. The only pre- 
liminary successor state of S and toggle(s]^) is 5" = (up(sj^), -ilight}, ob- 
tained through direct effect E = |up(s;L)}. Now, there is just one acceptable 
state containing E, viz. |up(s j^), light}. Being the only candidate, this state 
is of course the one closest to S' , hence it is the unique minimizing-change 
successor when performing toggle(s]^) in the state |-'up(s]^), -ilight}. ■ 

Our first approach to the Ramification Problem works fine with our small 
example domain. The interested reader may verify, for instance, that if in the 
successor state obtained above we toggle the switch again, then not only does 
it take its original position but also the light is off in the resulting minimizing- 
change successor. Or, suppose that in the current state depicted in Fig. 2.1 
we perform a non-deterministic action to the effect that the switch may or 
may not get closed, then there are two minimizing-change successor states: 
Either the switch is up and light is on, or else the position of the switch has 
not changed and the light stays off. 

For a general assessment of the approach of minimizing change, this ob- 
servation is crucial: All fluents of a state constraint which are not among the 
direct effects have equal right to change their value in case this constraint is 
violated by the preliminary successor at hand. This appears to be no prob- 
lem as long as only two fluents are involved in a constraint, one of which 
must have changed so that this constraint became violated.® As soon as a 
state constraint relates three or more fluents, however, it may be erroneous 
to consider equally possible all state adaptations that correct a violation. To 
illustrate this, we enhance our electric circuit by a second switch as shown in 
Fig. 2.3. This — at first glance innocent — modification has a surprising effect 
on the applicability of our first approach to the Ramification Problem. 

Example 2.2.2. Let T> be the domain consisting of entities S;]^ and S 2 , fluent 
names up^ and light®, action name toggle^, and the following action laws. 

® Surprisingly enough, even in this simple case the approach may lead to the 

wrong conclusion, e.g., if applied to a scenario discussed later, in Section 2.8. 




22 



2. The Ramification Problem 



^up(si) up(s2) 



M flight 



Figure 2.3. The electric circuit of Fig. 2.1 enhanced by a second switch. What 
would you expect as to the outcome of closing in the current state? 



toggle(a;) transforms {up(a;)} into {-■up(a;)} 

toggle(x) transforms {-■up(a;)} into {up(x)} 

The two switches and the light bulb being serially connected, we introduce 
the state constraint light = up(sj^) A up(s2). 

Let the current state be the acceptable {^up(s]^), up(s2), -■light} as 
depicted in Fig. 2 . 3 . Now suppose toggle(sj^) be performed in this state, 
then the unique preliminary successor is S' = {up(s]^), up(s2), -■light}, 
which is obtained through the direct effect E = {up(sj^)}. This state violates 
the state constraint. There are two acceptable states containing E, namely, 
Ti = {up(si),up(s2), light} and T2 = {up(si), -■up(s2), -■light}. As for 
the respective distance to S', we observe that Ti\ S' = {light} and T2 \ 
S' = (-■up(s2)}. Thus neither Ti ^5/ T2 nor T2 -<s' T\. Consequently, 
both Ti and T2 are minimizing-change successors. ■ 

So instead of coming to the obvious conclusion that the light must turn on as 
a side effect of toggling an equally possible course of events is suggested, 
namely, that switch S2 leaves its position and the light stays off! 

What is the reason for this unexpected outcome? A closer examination of 
the underlying state constraint reveals it. From light = up(sj^) Aup(s2) we 
can deduce that up(s]^) D light V ^up(s2) but not the stronger implication 
up(sj^) D light (nor, of course, up(sj^) D -■up(s2)). In words, suppose we 
know that switch is in the upper position, then it follows that light is 
on or switch S2 is down. It does not necessarily follow that the light is on. 
Therefore, if the state constraint becomes violated by up(s]^) becoming true, 
the mere disjunction light V -■up(s2) needs to be satisfied. Obviously, this 
can be achieved either by changing -■light to light or, voila, by changing 
up(s2) to -■up(s2). The pure state constraint does not allow to distinguish 
between these two possibilities in view of preferring the former, as one would 
like. Hence the two minimizing-change successor states. 
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There is a general principle behind the problem we have encountered 
with this example. Often a mere state constraint does not contain sufficient 
information as to what effects it is expected to trigger. From the perspective 
of logic, all fluent changes that correct a violation of a constraint are equal. 
Some of these changes, however, would never occur in reality as indirect 
effect. Consequently, additional domain knowledge is required that allows to 
distinguish the deductions which correspond to real effects from the mere 
logical consequences which have no equivalent in reality. This is a challenge 
of very general nature, which we will have to face at various places throughout 
the book. Our objective, therefore, is to develop a solution on the basis of 
a principle (yet to be found) which is as universal as the problem to be 
addressed. 



2.3 Categorizing Fluents 

At the end of the previous section we arrived at the insight that in general a 
mere state constraint is insufficient for a correct handling of indirect effects. 
This raises the issue of the nature of additionally required information. With 
regard to our example domain, it would of course be sufficient to add the very 
specific knowledge that “when closing the switch, an activation of the light 
bulb is likely to occur, as opposed to the other switch changing its position.” 
Yet our aim is to find a general criterion that helps us telling apart the correct 
indirect effects. That is to say, we seek the reason for us expecting the light 
turns on rather than that the switch moves. 

In fact, one essential difference between the light bulb and the switch is 
fairly apparent. Namely, the state of a switch can be affected only by direct 
operation. Its state is, in a certain sense, independent of the values of other 
fluents. In contrast, there are no means to directly operate the light bulb. 
Rather its state completely depends on the values of other fluents. From 
this perspective, it is no surprise that toggling a switch may indirectly affect 
the light but not the other switch. This observation suggests the following 
refinement of our first approach to the Ramification Problem. 

In order that only expected indirect effects are generated, the fluents of a 
domain are divided into two categories. One of which consists of those fluents 
that represent state components which are manipulated by direct operation. 
The other category contains all fluents that represent state components which 
depend on other components. Let us call primary fluents of the former kind 
and secondary those of the latter. What fluent belongs to which category 
cannot, of course, be gathered from the state constraints. Therefore, the 
categorization needs to be given as part of the domain specification. On the 
basis of this additional information, whenever the procedure of minimizing 
change offers a choice between a primary and a secondary fluent, then the 
former is preferred to remain unchanged — hence the latter is preferred to 
adapt as indirect effect. 
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A formalization of this approach requires just a marginal refinement of 
our concept of minimizing-change successors (recall Definition 2.2.1). Namely, 
the notion of distance shall now respect the preference for minimizing change 
of primary fluents. Otherwise our definition remains unchanged. 

Definition 2.3.1. Let T> = A, £) he a basic action domain. Further- 
more, let Tp (primary fluents) and Tg (secondary fluents) be two sets of 
fluents such that tFpfMFg = {} and TpUiFs is the set of all fluents composed 
of fluent names T and entities £ . If S, T, T' are states, then T is closer 
to S than T' wrt. Tp,Tg, written T <s T' iff 

1 . ||r\5||nj),£ ||T'\5||n.Fp, or 

||r\5||n.Fp = ||r\5||n.Fj, and ||t\ a|| £ ||r\5|| n.^,. 

Let C he a set of state constraints, S a state which is acceptable (wrt. C ), 
and a an action. A state T is categorized minimizing-change successor of S 
and a iff the following holds: There exists a preliminary successor S' of S 
and action a obtained through direct effect E such that 

1. E CT, 

2. T is acceptable, and 

3. there is no T' ^s’ such that E CT' and T' is acceptable, m 

With the refined notion of closeness of states we first concentrate on primary 
fluents (clause 1) when comparing distances. Only in case the distances are 
equal with this regard, secondary fluents become the decisive factor (clause 2). 

Example 2.3.1. Let domain V be as in Example 2.2.2 (recall the circuit of 
Fig. 2.3, with two switches). Suppose that Ep = {up(cc) : x G {si,S 2 }} 
and Eg = {light} be the fluent categorization for T>. When perform- 
ing toggle(sj^) in the acceptable state S = |-'up(s]^), up(s 2 ), “'light}, 
the (unique) preliminary successor is S' = |up(s;L), up(s 2 ), “'light}, which 
results through direct effect E={up(s;l^)} as before. There exist two ac- 
ceptable states containing E, namely, Ti = jup(s]^), up(s 2 ), light} and 
T 2 = |up(si),“'up(s 2 ), “'light}. From ||Ti \ S"|| (1 Ep = {light} nJp = {} 
and from \\T 2 \ S"|| C\ Ep = {up(s 2 )} (1 Ep = {up(s 2 )} we can conclude that 
Ti ^s' T 2 (but not vice versa, of course). Hence Ti is the unique 

categorized minimizing-change successor of {“'up(sj^), up(s 2 ), “'light} and 
toggle(si). ■ 

The distinction between primary and secondary fluents allows to recog- 
nize ‘phantom’ effects whenever these are obtained by changing a generally 
independent property where changing a dependent one would suffice to satisfy 
a state constraint. In this way we have avoided, for instance, the conclusion 
that a switch magically leaves its position where it was sufficient to turn on 
the light. On the other hand, it is obvious that this approach to the Ramifica- 
tion Problem relies on the existence of a suitable categorization for a domain. 
The following example with a newly extended electric circuit shows that this 
is not guaranteed. 
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-.up(si) up(s2) 




-.light 



Figure 2.4. Yet another enhancement of our electric circuit: It now includes a 
third switch, S 3 , and a relay, whose state is represented by the nullary fluent 
relay. When activated, the relay forces switch S 2 downwards. 



Example 2.3.2. Consider the electric circuit depicted in Fig. 2.4. Let this be 
modeled by domain T> consisting of entities S;]^, S2, and S3; fluent names 
up^, light°, and relay^; action name toggle^; and the action laws 

toggle(a;) transforms {up(x)} into {-.up(x)} 

toggle(a;) transforms {-.up(x)} into {up(x)} 

Let C be the three state constraints 

light = up(s]^) A up(s 2 ) 

relay = -'up(si) A up(s3) ( 2 . 1 ) 

relay D ^up(s2) 

The last formula states that the activated relay attracts switch S2- Let the 
current state he S = {-.up(s;L), up(s2), ~'up(s3), -.light, -.relay}, as shown 
in Fig. 2.4. 

Now, for a suitable account of possible indirect effects of toggling switches 
in this circuit we are supposed to categorize the involved fluents. This is 
straightforward in case of up(s]^) and up(s3) (being primary) and light 
and relay (being secondary); it is easy to verify that any other choice would 
immediately cause undesired effects. But what about the second switch, rep- 
resented by up(s2)? On the one hand, we naturally tend to consider it pri- 
mary as before. For if is toggled in the current state, then S2 is sup- 
posed not to change where light is expected to do so. On the other hand, 
there is reason to consider up(s2) secondary. Namely, if we close switch S3 
in the current state, then the relay gets activated and, hence, is expected 
to cause switch S2 leave its position. Formal examination reveals that this 
would not be the only categorized minimizing-change successor in this sce- 
nario had we Ep = |up(s]^), up(s2), up(s3)| and Es = {light, relay}: The 
unique preliminary successor of state S from above and action toggle(s3) 
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is S' = {->up(s up(s2), up(s3), -ilight, ^relay}, obtained through direct 

effect E = {up(s3)}. There are exactly three acceptable states containing E, 
viz. 

Ti = {^up(si),-.up(s2),up(s3), -.light, relay} 

T2 = {up(si),up(s2),up(s3), light, -.relay} 

T3 = {up(si),-.up(s2),up(s3), -.light, -.relay} 

To see why, observe first that both light and relay are completely deter- 
mined by the positions of the switches (c.f. formulas (2.1)). There are four 
different combinations of switch positions given that up(s3), one of which, 
however, is not acceptable, namely, where only is down. As for the re- 
spective distance to S', we have 

||Ti \ S"|| n = {up(s2), relay} n.7^p = {up(s2)} 

IIT2 \ S"|| n.7^p = {up(si), light} n.7^p = {up(si)} 

IIT3 \ S"|| n.7^p = {up(si),up(s2)} n.7^p = {up(si),up(s2)} 

It follows that both Ti and T2 -<s' T3 whereas 

Ti and T2 are not comparable wrt. S' . Hence the two are categorized 
minimizing-change successors of S and toggle(s3). ■ 

The existence of the unintended successor, i.e., T2 , where switch changes 
instead of switch S2 , can be explained by the necessity of changing a pri- 
mary fluent — aside from up(s3), which was the direct effect — to arrive at 
an acceptable state. In the intended successor, i.e., Ti, this change concerns 
switch S2, triggered by the activation of the relay. This very activation is 
avoided, however, by moving switch instead, as done in T2. In both ways 
we respect the idea of minimizing change of primary fluents. Hence the two 
successor states. 

The reason for this unexpected result is that we necessarily fail to assign 
a unique appropriate category to fluent up(s2), whose role is twofold: On the 
one hand, it should be considered primary regarding the sub-circuit involving 
switch and the light bulb, and on the other hand, it behaves like a 
secondary fluent as regards the relay. From a general perspective, this proves 
that it might be impossible to globally characterize a fluent either as always 
being ‘active’ or as always being ‘passive.’ The way a fluent behaves rather 
seems a more local property, depending on which of the related components 
one considers. In other words, a fluent can be active regarding one aspect 
and passive regarding another. As for our example, one might suggest just 
introducing an additional category of, say, tertiary fluents, Tt, with even 
lower priority than secondary fluents. Then taking up(s;[),up(s3) G Tp, 
up(s2), relay G Ts, and light G Tt yields the expected unique resulting 
state, provided an appropriate extension of our notion of state distance wrt. a 
categorization (recall Definition 2.3.1). However, this particular classification 
requires a deeper analysis of possible direct and indirect effects in the electric 
circuit and seems, at first glance, quite unnatural. It is, moreover, not hard 
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to imagine more complex domains demanding more and more categories, 
which heavily increases the difficulty of deciding which class a particular 
fluent should belong to. In particular, the addition of constraints to a domain 
specification may require a modification of the original categorization (as in 
our example, where up(s 2 ) moved to Tg and light to Tt ) — which is to 
be considered inadequate. 



2.4 Causal Relationships 

The different approaches with which we attempted to tackle the Ramifica- 
tion Problem thus far mainly revealed that accounting for indirect effects 
of actions is more challenging an enterprise than it seems at first glance. 
Although the occurrence of indirect effects is triggered by the necessity to 
correct violations of state constraints, the very constraints turned out not 
to suffice for recognizing logically proposed effects that would never occur in 
reality. Providing additional domain knowledge in form of global classifica- 
tion of fluents proved too coarse in general as well. Whether or not a fluent 
is expected to change as indirect effect is a more local property, depending 
on the context. What we need, therefore, is a concept for generating indirect 
effects on the basis of knowledge whose structure respects and reflects this 
context-sensitivity. The notion of causal relationships will serve this purpose. 
Being a key concept of this book, these relationships will be introduced and 
thoroughly analyzed in this and the following four sections. 



2.4.1 Causal Relationships and Their Application 

Causal relationships are formalizations of the circumstances under which the 
occurrence of a single indirect effect is to be expected. Two components 
constitute these circumstances. One of which describes the context required 
for the indirect effect. The context is represented by a fluent formula that 
needs to be satisfied in a state in order that the causal relationship applies. 
The second component is a particular single effect whose prior occurrence 
causes (hence the name) the indirect effect in question. This triggering effect 
itself may have been previously obtained as indirect effect, if it was not among 
the direct effects of the action. 

The distinction between a context and a particular triggering effect is 
essential. The reason for this will be elucidated in a moment. First, let us 
consider an example of a causal relationship, which occurs in the electric 
circuit with just two switches and the light bulb of Fig. 2.3. Suppose switch 
gets closed as a direct effect of some action. This effect is expected to cause 
the light go on as indirect effect, provided switch S 2 is closed, too. We write 
this causal relationship as 

up(sj^) causes light if up(s 2 ) (2.2) 




28 



2. The Ramification Problem 



Here up(s^) is the effect whose appearance triggers indirect effect light 
in the context defined by the atomic fluent formula up(s 2 ). Of course, the 
analogue is true as well in this particular domain, viz. 

up(s 2 ) causes light if up(s;[) (2-3) 

Yet this symmetry is not always true, as we will see later. In any case, the 
two causal relationships express different things: The former applies whenever 
up(s;[) became true with up(s 2 ) already being true, as opposed to the latter, 
which applies whenever up(s 2 ) became true while up(s]^) holds.® 

Causal relationships shall be employed to generate additional, indirect ef- 
fects of actions after having obtained the direct effects through the application 
of an action law. Formally, causal relationships operate on state-effect pairs 
(S,E) where S is some current ‘intermediate’ state, a preliminary successor, 
for instance, and E contains all direct and indirect effects that have been 
generated so far. As an example, recall that S' = {up(sj^), up(s 2 ), -■light} 
is the preliminary successor of toggling the first switch in the state depicted 
in Fig. 2.3. State S' is obtained through direct effect E = {up(s]^)|. The 
causal relationship (2.2) above is applicable to (S',E) on account of both 
up(sj^) being part of effect E and up ( 32 ), the context, being true in S' . 
This relationship implies that S' should be modified so as to account for the 
indirect effect light, which yields the state S" = {up(s;[), up(s 2 ), light}. 
In addition, we augment E by our new effect, light. Altogether the re- 
sult of applying the causal relationship to (S", {up(s]^)}) is the state-effect 
pair {S", {up(s;^), light}). Notice that our second causal relationship, (2.3), 
should not be applicable to (S',E) despite up(s]^) being true in S", because 
effect up(s 2 ) is not contained in E. 

The reason for maintaining the second component, E, is that identical 
intermediate states (such as S') can often be reached by different effects, each 
of which may require diverse, sometimes opposite treatment, as the following 
example illustrates. 

Example 2.4-1. Suppose two switches S;^, S 2 be tightly coupled by a spring 
so that they are always in the same position; see Fig. 2.5. This is reflected by 
the state constraint up(s;[) = up(s 2 ). As a consequence, closing or opening 
either switch has the indirect effect that the other switch closes or opens, 
respectively, as well to release the tension of the spring. This is expressed by 
these four causal relationships: 



up(si) 


causes 


up(s2) 


if 


T 


up(s2) 


causes 


up(si) 


if 


T 


^up(si) 


causes 


^up(s2) 


if 


T 


^up(s2) 


causes 


-■up(si) 


if 


T 



If both switches get closed as direct or indirect effects, then of course either 
relationship is applicabl and the result is the same, namely, light. 
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^up(si) 




Figure 2.5. Two switches are mechanically connected through a tight spring, whose 
tension does not allow the switches occupy different positions. 



Now, suppose we switch up the first switch in the state {->up(s;L), -iup(s2)}. 
This yields the preliminary successor S' = {up(s]^), -iup(s2)}. On the other 
hand, suppose we switch down the second switch in the state {up(s;[), up(s2)}. 
This obviously yields the very same preliminary successor S'. Nonetheless 
the expected outcomes in these two situations differ considerably: In the first 
case the final result should be that both and S2 are in the upper po- 
sition, as opposed to the second case where both are expected down! This 
distinction can only be made by referring to the differing direct effects, viz. 
El = {up(s]^)} as opposed to E2 = {-■up(s2)}. The former enables only 
the application of the very first relationship in (2.4) to the intermediate re- 
sult, the latter only the application of the very last one. The two resulting 
state-effect pairs are 

({up(si),up(s2)}, {up(si),up(s2)}) 

( {-up(si), ^up(s2)} , {-up(s2), -up(si)} ) 

Thus in either case we obtain the intended successor state. ■ 

Incidentally, this example also illustrates the necessity of distinguishing trig- 
gering effects from contexts in causal relationships: There is an obviously 
crucial difference between the situation where up(sj^) became true in the 
context -iup(s2) and the situation where -iup(s2) became true in the con- 
text up(s]^).^ 

Curious as the last example may seem at first glance, such tight cou- 
pling between fluents frequently occurs in an important class of state con- 
straints. So-called “definitional” constraints define, for the sake of conve- 
nience, a fluent as abbreviation for a complex fluent formula. The constraint 

^ Let us add the marginal note that dividing the condition for the occurrence of 
an indirect effect into two components matches the distinction often made in 
philosophical accounts of causality between so-called “triggering” and “predis- 
posal” causes. 
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Vx [down(a:) = -'up(a;)], for instance, defines down as not being up. Clearly, 
whenever an instance of up(x) is changed the respective instance of down(a;) 
is supposed to change accordingly — and vice versa. This brings about simi- 
lar situations as in our spring scenario, so that their correct treatment, too, 
relies on the distinction between triggering effect and context. In passing, let 
us mention that definitional state constraints contribute to another special 
problem, which will be discussed in detail in Section 2.7. 

Much like with action laws, it might be convenient to use variables in 
causal relationships in order to summarize a whole collection of similar ground 
instances. E.g., suppose all switches of an electric circuit and a light bulb are 
serially connected, then 

up(x) causes light if \/y.wp{y) 

states that having switched up any switch x causes light provided all switches 
are (now) in the upper position. With this we arrive at the following formal 
definition of causal relationships and their application to state-effect pairs. 

Definition 2.4.1. Let S and T be sets of entities and fluent names, re- 
spectively. A causal relationship is of the form 

e causes p if ^ 

where (the context) is a fluent formula and both e ('t/ie triggering effect j 
and g (the ra,mitica,tion) are fluent expressions.^ 

Let {S, E) be a pair consisting of a state S and a set of fluent liter- 
als E. Furthermore, let r = e causes p if be a causal relationship, and 

let X denote a sequence of all free variables occurring in s, g, or <F. Then a 
ground instance r[e] is applicable to (S,E) iff e\e] G E and ^[e] A-'p[e] is 
true in S. The application of r[e] to (S,E) yields the pair {S',E') where 
S" = (S' \ {-'p[e]}) U {p[e]} and E' = {E \ {-^ g\e\\) VJ {g\e\\ . Lf TZ. is a set of 
causal relationships, then by (S,E) {S',E') we denote the existence of 

a relationship in TZ whose application to (S,E) yields {S',E'). ■ 

To summarize, a causal relationship £ causes p if is applicable if con- 
text F holds, the indirect effect-to-be g is currently false, and if its cause e 
is among the current effects. A relationship is applied by changing -<g to g 
in the current state and adding g to the current effects. In order that the 
latter does not produce an inconsistency among the effects, a possibly pre- 
ceding effect —'g is withdrawn.® This precaution guarantees that whenever 
(S,E) {S',E') with S being a state and E being consistent, then 

S' is a state and E' is consistent, too. 

® Recall that fluent expressions are of the form {-<)f{ti, . . . , t„) with / being a 
fluent name and ti a variable or entity. 

® It may sound strange to presuppose the occurrence of such situations instead 
of disregarding them, or even excluding them for that matter. Yet later, in 
the following but one section, we will meet an example scenario where this 
withdrawing a previously obtained indirect effect is necessary and perfectly 
reasonable. 
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Thus far we have seen how applying a single causal relationship generates 
one particular indirect effect. The direct effects of an action may of course 
give rise to several indirect effects. Moreover, these effects may in turn cause 
further effects, possibly triggering even more and so on and so forth. Closing 
switch S 3 in our circuit involving the relay (Fig. 2.4), for example, causes an 
activation of that relay, which in turn triggers an indirect effect by attracting 
switch S 2 . These chains of indirect effects are modeled by serial application 
of causal relationships. 

Example 2.4-2. Let T> be the basic action domain formalizing the electric 
circuit of Fig. 2.4 as in Example 2.3.2. As for the various causal dependencies 
among the components, we notice, first, that light turns on if either of the 
two switches and S 2 is caused up with the other one already being 
in its upper position. Conversely, light is off if either of these switches is 
caused down, regardless of the other switch’s position. This is reflected by 
introducing the following four causal relationships: 

up(sj^) causes light if up(s 2 ) ^up(s]^) causes -ilight if T 

(^. 2.0 j 

up(s 2 ) causes light if up(sj^) ^up(s 2 ) causes -ilight if T 

Analogously, the state of the relay causally depends upon the two switches 
and S 3 as follows: 

-iup(sj^) causes relay if up(s 3 ) 
up(s3) causes relay if ->up(s]^) 

Finally, the activated relay forces switch S 2 be down, that is, 

relay causes -iup(s2) if T ( 2 . 7 ) 

Now, let S = {-iup(s]^), up(s2), -iup(s3), -ilight, -irelay} be the current 
state, as depicted in Fig. 2.4. Performing the action toggle(s 3 ) yields the 
following state-effect pair. 

( {^up(si), up(s2), up(s3), -ilight, -irelay} , {up(s3)} ) 

The one and only applicable causal relationship is the bottom left one in (2.6), 
which activates the relay: 

( {-'up(si), up(s2), up(s3), -ilight, relay} , |up(s3), relay} ) 

As a consequence of this indirect effect, the relationship of (2.7) is now ap- 
plicable, which results in 

( {^up(si), -iup(s2), up(s3), -ilight, relay} , |up(s3), relay, -iup(s2)} ) 

This state-effect pair allows no further application of causal relationships. 
Incidentally, its first component is acceptable wrt. the underlying state con- 
straints (c.f. formulas ( 2 . 1 )) and constitutes the (unique) resulting state ex- 
pected when we close the third switch in initial state S. m 



up(s;]^) causes -irelay if T 
-iup(s 3 ) causes -irelay if T 



( 2 . 6 ) 
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By now it should have become clear how causal relationships are used 
to address the Ramification Problem. Starting with some preliminary suc- 
cessor state, we (non-deterministically) select and (serially) apply these rela- 
tionships, each of which accommodates a single indirect effect. Provided the 
underlying set of causal relationships is designed so as to suitably represent 
real causal dependencies in a domain, all effects thus obtained are reasonable 
from the standpoint of causality. Whenever the application of a series finally 
produces an acceptable state, then the latter is taken as possible overall suc- 
cessor. Notice that each fluent which holds in the initial state and which is 
not touched at any step of this procedure remains unchanged. This approach 
therefore accounts both for rigorous persistence of non-affected parts on the 
one hand and for arbitrarily complex chains of indirect effects on the other 
hand. 

In what follows, we say that a sequence of causal relationships ri , . . . , r„ 
(n > 0) is applicable to a pair (5'o, i?o) iff we can find n pairs (Si,Ei ), . . . , 
(Sn,En) such that for each 1 < i < n, ri is applicable to {Si-i, Ei-i) yield- 
ing (Si,Ei). We adopt a standard notation in writing (S,E) {Sn, En) 
to indicate the existence of a (finite, possibly empty) sequence of causal rela- 
tionships in TZ which is applicable to (S,E) with final result (^n, if„). The 
following is a formal definition of the notion of successor states on the basis 
of causal relationships. 

Definition 2.4.2. Let £, T , A, and L besets of entities, fluent names, ac- 
tion names, and action laws, respectively. Furthermore, let C he a set of state 
constraints and TZ a set of causal relationships. If S is an acceptable state 
and a an action, then a state S' is a causal successor of S and a iff the 
following holds: Set C contains an applicable instance a transforms C into E 
of an action law such that 

1. {{S\C)\JE,E) Ain {S',E') for some E' , and 

2. S' is acceptable. ■ 

For a subtle reason to be revealed in Section 2.7 we are not yet completely 
satisfied with this definition as a general solution to the Ramification Prob- 
lem. This is why we postpone the adaptation of the notions of interpretations 
and models to this later section. 

The way causal relationships are employed makes no presuppositions as 
to the order in which they are applied. The two relationships used in our 
relay example to account for the indirect effects of closing the third switch 
can, however, be applied in only one order. For the chronologically second 
requires the result of the first, viz. relay activation, as the triggering effect. 
On the other hand, in general there may exist different ways of sequentially 
applying causal relationships. The formalization of our example circuit with 
n parallel sub-circuits each consisting of a switch-bulb pair (recall Fig. 2.2) 
shall illustrate this. 

Example 2.4.3. Let (n > 2) be the basic action domain consisting of 
entities sq, S;^, . . . , s„, fluent names up^, light]^°, . . . , light„°, action name 
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toggle^, and action laws 

toggle(a;) transforms {up(x)} into {-'up(x)} 

toggle(a;) transforms {-'up(a;)} into {up(a;)} 

Suppose further given the n state constraints light^ = up(so) A up(si) 
(1 < i < n) and, for each i = 1, . . . ,n, these four causal relationships: 

up(so) causes light^ if up(si) ^up(so) causes ^light^ if T 

up(si) causes light^ if up(so) -iup(si) causes ^light^ if T 

(2.8) 

(Notice the size of this specification, which is linear in n as opposed to 
the exponential number of action laws needed when formalizing this do- 
main without the concept of indirect effects.) Now, suppose all switches 
are closed except for sq and, hence, all light bulbs are off. That is, let 
S = {^up(so),up(s;L), . . . ,up(s„), -ilighti, . . . , -ilight^} be the current, 
acceptable state. Performing action toggle(so) in this state yields the pre- 
liminary successor S' = {up(sq), up(s;L), . . . ,up(s„), -ilight;^, . . . , -ilight^} 
along with the direct effect E = {up(so)}. Apparently, the state-effect pair 
(S',E) allows the application of all n causal relationships of the form 
up(so) causes light^ if up(si) where i = Whichever of these is 

executed first, the other n — 1 relationships remain applicable in the result- 
ing state-effect pair, and so forth. Thus there exist n ! different sequences, 
all of which obviously result in the same causal successor state, namely, 
{up(so),up(s]^), . . . ,up(s„), light;L, . . . , light„}, i.e., where all light bulbs 
are now active. ■ 

All application sequences coming to the identical result in this example 
raises the question whether order irrelevance holds in general. The following 
proposition states that indeed any permutation of a sequence of causal rela- 
tionships arrives at the same conclusion, provided the permuted sequence is 
applicable. 

Proposition 2.4.1. Let S and T he sets of entities and fluent names, 
respectively, Sq be a state, and Eg a set of fluent literals. Furthermore, let 
ri,...,r„ he a sequence of causal relationships (n > 0) which is applicable 
to {Sq,Eq) and which yields 

(So,Eq) '^{n} {Si, El) ^{ra} («5'n,A„) 

Then, for any permutation r^(i),. • ., r^(„) which is also applicable to {So, Eq) 
and which yields 

(S'o,Ao) {S'i,E'i) ... 

we have S'„ = S{, and E„ = 



V(n)} {S'n,K) 
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Proof. Let / be an arbitrary but fixed fluent. By kj we denote the number 
(possibly zero) of relationships = £j causes f if <Pi, and by kj the number 
(possibly zero, too) of relationships rj = Sj causes -■/ if <Pj (l < i,j < n). 
Since a causal relationship Si causes / if can only be applied to some 
{Si-i, Ei_i) if - 1 / € Si-i (and vice versa in case of indirect effect -i/), the 
values for k~^ and kJ determine the final truth-value of / as follows: 

1. If f G S, then either k~^ = kJ or k~^ = kJ — 1. In the former case 
we have / € S'„. We also have / G if kJ >0, otherwise / ^ En 
and -i/ ^ En since no causal relationship affects /. In the latter case 
we have both -if G Sn and -if G En- 

2. If -if G S, then either k^ = kJ or k~^ = kJ + 1. In the former case we 
have -if G Sn- We also have -if G E„ if kJ > 0, otherwise -if ^ En 
and / ^ En since no causal relationship affects /. In the latter case we 
have both f G Sn and f G En- 

Since the permutation £,^( 1)1 • ■ • >?' 7 r(n) contains exactly the same causal re- 
lationships as the original sequence, they do not differ in the values for k~^ 
and kJ . Thus Sn and S'n agree as far as / is concerned, and so do En 
and E'n- Fluent / being an arbitrary choice proves the claim. Qed. 

While this result proves general invariance with regard to applicable per- 
mutations of causal relationship sequences, it does not imply confluence of 
the whole procedure in general. In fact, a different selection at the beginning 
may allow for the application of a completely different collection of causal 
relationships. If two chains of relationships do not contain identical elements, 
then neither is a permutation of the other and our Proposition 2.4.1 does 
not apply. This is, however, not at all a drawback or even fallacy, as one 
might guess at first. Rather it allows to accommodate actions which are de- 
terministic as far as their direct effects are concerned but non-deterministic 
as regards the indirect effects they possibly trigger. That is, even if a unique 
preliminary successor exists, performing an action in some state may ad- 
mit more than one possible causal successor state. An example for an action 
that is non-deterministic in this sense occurs in a domain to be presented in 
Section 2.6. 

The opposite to the existence of multiple successor states is that no suc- 
cessor at all can be found despite one or more action laws are applicable to the 
state at hand. That is, no chain of causal relationships manages to transform 
a preliminary successor into an acceptable state. This hints at additional, 
implicit preconditions for the action in question — preconditions which derive 
from state constraints. Section 2.8 will be devoted to this phenomenon. In 
the following section, we first raise another central issue, namely, how causal 
relationships and the underlying state constraints are related — in particular, 
we seek a way to automatically extract the former from the latter. 
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2.5 Influence Information 

The causality-based approach to the Ramification Problem relies, to state 
the obvious, on the underlying set of causal relationships being suitable for 
the domain at hand. This set should be sound in that each element repre- 
sents an intuitively plausible causal relation, and it should also be complete 
in covering all conceivable indirect effects that derive from the given state 
constraints. The various causal relationships we used in our relay example 
(c.f. (2.5)-(2.7)), for instance, arguably constitute a suitable collection for this 
domain. Apparently there is a tight correspondence between these nine rela- 
tionships and the three underlying state constraints (c.f. formulas (2.1)). For 
example, causal relationship up(s 3 ) causes relay if -iup(s;j^) derives from 
state constraint relay = -■up(s]^) A up(s 3 ), which entails the implication 
->up(s;l) a up(s 3 ) D relay. Generally we have that for any e causes ^3 if ^ 
of our nine relationships, the corresponding fluent formula <P Ae Z) g can be 
deduced from one of the state constraints. 

This observation suggests that causal relationships might be somehow au- 
tomatically extractible from a set of state constraints. On the other hand, 
most of our discussion prior to introducing causal relationships centered 
around the problem that state constraints contain insufficient information 
to decide what are the indirect effects that can possibly occur in real- 
ity. To restate the problem, notice that our example constraint relay = 
-■up(sj^) A up(s 3 ) also entails implication -relay A up(s 3 ) D up(s;]^). The 
corresponding causal relationship up(s 3 ) causes up(sj^) if -relay , however, 
does not hold. There is an information gap between the mere state constraints 
and the valid causal relationships they determine. In the present section, we 
concern ourselves with filling this gap without the necessity of drawing up, 
all by hand, the set of causal relationships. That is, we seek a general method 
that allows a more compact providing of the additionally required informa- 
tion. 

In the previous but one section we discussed the range of applicability 
of tackling the Ramification Problem by categorizing fluents as either pri- 
mary or secondary. Our analysis showed that even in simple domains it can 
be impossible to globally characterize a fluent either as one which changes 
independently of all other fluents or as one which is generally manipulable 
by others. It turned out to be a local property whether or not a fluent is 
expected to change as indirect effect. In our example circuit involving the 
relay, for instance, the second switch is expected independent of switch 
but may certainly be affected by the relay. In other words, fluent up(s]^) 
has no direct influence on fluent up(s 2 ) , as opposed to fluent relay. Domain 
knowledge stating whether a particular fluent may or may not directly affect 
another particular fluent should therefore help telling apart the correct causal 

Actually it is more precise to say the second switch does not directly depend 

on switch Indirectly it does so, for the latter controls the relay, which in 

turn may control the second switch. 




36 



2. The Ramification Problem 



relationships. Formally, this so-called influence information is specified as a 
binary relation on the set of fluents. 

Definition 2.5.1. Let S and T be sets of entities and fluent names, respec- 
tively. An irreflexive binary relation I on the set of fluents is called influence 
information. ■ 

If (/i,/2) G I, then this is intended to denote that a change of fluent /I’s 
truth- value potentially affects the truth- value of fluent /2. For convenience, a 
pair in I may contain variables, thus representing all of its ground instances. 

Example 2.5.1. Consider the entities S = {sj^, S2, S3} along with the fluents 
.7^ = {up^, light^, relay^j. The two switches and S2 may influence the 
light but not vice versa nor do they mutually interfere (c.f. Fig. 2.4). Likewise, 
S;]^ and S3 may influence the relay, which in turn potentially affects the 
position of S2. Thus the correct influence information I is 

{(up(si), light), (up(s2), light), (up(si), relay), (up(s3), relay), , . 

(relay, up(s2))} 

Notice how the twofold nature of up(s2) is reflected in this influence 
information: The fluent occurs both as first (the active) component in 
the pair (up(s2, light) and as second (the passive) component in the 
pair (relay, up(s2)) . This is why it was impossible to globally categorize 
this fluent as either primary or secondary. ■ 

Our two switches which are connected by a spring (recall Fig. 2.5) show 
that influence information may induce cycles and even be symmetrical. 

Example 2.5.2. Let E = {si,S2| and T = {up^j. The two switches being 
mutually dependent, the correct influence information X is the following. 

{(up(si),up(s2)),(up(s2),up(si))| ■ 

Given correct information of potential influence, causal relationships are 
extractible from state constraints as follows. For the sake of clarity, let us 
first focus on quantifier-free, i.e., propositional fluent formulas. The general 
idea is investigating all potential ramifications and excluding those which 
do not respect the notion of influence. To this end, we consider all possible 
violations of a state constraint and formulate suitable causal relationships 
which help ‘correct’ this. Let us get more precise. Suppose C is a state 
constraint. First we construct the minimal conjunctive normal form (CNF, for 
short)^^ of this formula. Obviously, C is violated if and only if some conjunct 

A CNF of a (non-tautological, non-contradictory) formula C is some logical 
equivalent of the form Gi A . . . A G„ (n > 1) such that each Ci is of the form 
V . . . V iimi {mi > 1) with each iij being a fluent literal. The minimal 
CNF is a CNF such that for each conjunct Ci = In \/ ... W limi there is no 
strict subset L ’f. {in V ... V iirrn } such that C \= The minimal CNF 

is unique modulo associativity, commutativity, and idempotency of A and V. 
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input C : state constraint; 

I : influence information; 

output TZ : set of causal relationships; 

begin 

let TZ — {}; 

let Cl A ... A Cn be the minimal CNF of C ; 
for all i = 1 to n do 

let fi V ... V = Ci ; 
for all j — 1 to m do 

for all k — 1 to m, k j do 
if (llffll. pfcll) £2: then 

let 77. = 77. U { -ifj causes £k if 

end-for 
end 

Figure 2.6. The automatic extraction of causal relationships from state constraints 
on the basis of influence information. 

V . . . V of its CNF is violated. This in turn means that -ifi A ... A 
holds. Since the initial state supposedly has satisfied constraint C, the reason 
for -i£i A ... A -'im being true must be the occurrence of some (direct or 
indirect) effect G {“'fi, ■ ■ . , This violation of C can be ‘corrected’ 

by changing some other literal of this collection to via a causal 

relationship — but only in case fluent \\£j\\ potentially affects fluent ||ffc|| 
according to I. This way of generating causal relationships is formalized in 
the algorithm depicted in Fig. 2.6. 

Example 2.5.3. Consider the entities £ = {s]^, S 2 , S 3 } along with the fluents 
T = {up^, light®, relay®} and influence information X = (2.9). Further- 
more, let C be the familiar state constraints 

light = up(sj^) A up(s 2 ) 
relay = ^up(s^) A up(s 3 ) 
relay D -iup(s 2 ) 

Our algorithm applied to the elements of C in conjunction with X traces as 
follows: 

1. The minimal CNF of light = up(s]^) A up(s 2 ) is 

(->up(s]^) V -'up(s 2 ) V light) A (up(s j^) V -ilight) A (up(s 2 ) V -■light) 

Regarding conjunct C\ = -iup(s;L) V ->up(s 2 ) V light we obtain the 
following: 



A M 

I = 1, . . . , m 
I 3, I ^ k 
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- In case j = l,k = 2 we have (up(s]^), up(s2)) ^ X. 

- In case j = 1, A: = 3 we have (up(s]^), light) G X, which yields 

up(s]^) causes light if up(s2) 

- In case j = 2, A: = 1 we have (up(s2), up(s]^)) ^ I. 

- In case j = 2, A: = 3 we have (up(s2), light) G X, which yields 

up(s2) causes light if up(s;[) 

- In case j = 3,A: = I we have (light, up(s;L)) ^X. 

- In case j = 3, A: = 2 we have (light, up(s2)) ^ X. 

Regarding conjunct C 2 = up(s;]^) V -ilight we obtain the following: 

“ In case j = I, A: = 2 we have (up(s]^), light) G X, which yields 

-mp(s;l^) causes -ilight if T 

- In case j = 2,A: = 1 we have (light, up(s;[)) ^X. 

Regarding conjunct C3 = up(s2) V -ilight we obtain the following: 

- In case j = 1, A: = 2 we have (up(s2), light) G X, which yields 

-'up(s2) causes -ilight if T 

- In case j = 2, A: = I we have (light, up(s2)) ^ X. 

2. The minimal CNF of relay = -■up(s;]^) A up(s3) is 

(up(sj^) V -'up(s3) V relay) A (->up(s;L) V -■relay) A (up(s3) V ->relay) 

Regarding conjunct Ci = up(sj^) V ^up(s3) V relay we obtain the fol- 
lowing: 

- In case j = I, A: = 2 we have (up(s]^), up(s3)) ^ X. 

- In case j = I, A: = 3 we have (up(s]^), relay) G X, which yields 

->up(s;l^) causes relay if up(s3) 

- In case j = 2, A: = 1 we have (up(s3), up(s]^)) ^ I. 

- In case j = 2, A: = 3 we have (up(s3), relay) G X, which yields 

up(s3) causes relay if ->up(s]^) 

- In case j = 3,A: = 1 we have (relay, up(s;L)) ^X. 

- In case j = 3, A: = 2 we have (relay, up(s3)) ^ X. 

Regarding conjunct C 2 = -■up(s]^) V -irelay we obtain the following: 

- In case j = 1, A: = 2 we have (up(s]^), relay) G X, which yields 

up(s;l^) causes -irelay if T 

- In case j = 2,k = 1 we have (relay, up(s;L)) ^X. 

Regarding conjunct C3 = up(s3) V -irelay we obtain the following: 
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- In case j = 1, k = 2 we have (up(s 3 ), relay) G I, which yields 

->up(s 3 ) causes -relay if T 

- In case j = 2,k = 1 we have (relay, up(s 3 )) ^ X. 

3. The minimal CNF of relay D -iup(s 2 ) is 

-•relay V ^up(s 2 ) 

Regarding the only conjunct C\ = -irelay V -iup(s 2 ) we obtain the 
following: 

- In case j = l,k = 2 we have (relay, up(s 2 )) G X, which yields 

relay causes -■up(s 2 ) if T 

- In case j = 2,k = 1 we have (up(s 2 ), relay) ^ X. 

Altogether, the output is exactly the nine causal relationships which we have 
obtained in the preceding section by intuitive analysis of causal dependencies. 



As a small exercise, the reader may verify that the algorithm applied to the 
switches and spring-domain (Example 2.4.1 and Fig. 2.5) as well produces 
the correct outcome: The input consisting of the state constraint up(sj^) = 
up(s 2 ) and influence information X = {(up(s j^), up(s 2 )), (up(s 2 ), up(s]^))} 
results in the output of all four expected causal relationships, (2.4). 

The causal relationships e causes g if generated by our procedure all 
have a restricted syntax. Namely, context ^ is a conjunction of fluent literals 
instead of an arbitrarily complex fluent formula. This does not imply, how- 
ever, that some causal information otherwise being representable cannot be 
obtained through automatic generation. This is so, because any causal rela- 
tionship can be transformed into an operationally equivalent set of causal rela- 
tionships which obey the restriction. For suppose <?iV. . is a disjunctive 

normal form (DNF, for short) of some formula <F, then a causal relation- 
ship £T causes if iF and the collection e: causes p if , . . . , e causes p if 
are interchangeable. On the other hand, in certain cases exploiting full ex- 
pressiveness leads to considerably more compact representations. This can 
be accomplished by considering all automatically extracted causal relation- 
ships for a particular e and g, i.e., e causes £» if . . . , e causes g if 
and constructing formula 'f' as compact equivalent of V . . . V Then 
£: causes p if replaces the aforementioned n relationships. This first trans- 
forming state constraints into a normal form, then extracting causal rela- 
tionships, and finally retransforming the result into non-normal form may of 
course require extensive computation. In certain cases it might be much faster 
to employ more sophisticated means to straightly extract causal relationships 

On the analogy of CNF, a DNF of a formula is some logical equivalent of the 

form Di V . . . V D„ (u > 1) such that each Di is of the form A . . . A 

(mi >1) with each iij being a fluent literal. 
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from state constraints, i.e., by avoiding the roundabout way of constructing 
normal forms. On the other hand, the generation of causal relationships is 
a pre-processing step which is carried out only once for a fixed set of state 
constraints. This computational effort is therefore of minor importance. 

Talking about complexity, a related crucial issue concerns the number of 
causal relationships generated from a set of state constraints. This number 
is, in the worst case, exponential as regards the size of the input. The rea- 
son is that formulas exist which admit only CNF’s of exponentially increased 
length. Moreover, up to quadratic many relationships exist for a single con- 
junct, namely, if all involved fluents have the potential to affect each other. 
Despite this negative result, fortunately there is a decisive characteristic due 
to which especially in large domains the number of relationships is small 
compared to the worst case: The state constraints do not interfere when de- 
termining causal relationships. In general, large domains tend to be locally 
structured in that each single state constraint relates only a small fraction 
of the entire set of fluents. More precisely, let k be the maximum size of a 
state constraints^ and n their overall number. Then the number of required 
causal relationships is of magnitude 0(2^ • n). Granted that k remains con- 
stant with increasing domain size, the number of causal relationships thus 
is linear in the number of state constraints. E.g., in our example involving 
n sub-circuits (recall Fig. 2.2), each of the n state constraints relates only 
three fluents. Consequently, as we have seen (c.f. (2.8)) they give rise to a 
linear number of causal relationships (4 • n, to be precise). Incidentally, the 
fact that state constraints do not interfere in determining causal relationships 
solves the second problem mentioned in the introduction to the Ramification 
Problem. No existing causal relationship requires modification or needs to be 
removed if state constraints are added. Introducing a new switch-bulb pair 
up(s„+i), light„^]^ in the example just mentioned, for instance, amounts to 
adding four new causal relationships but requires no further changes. 

Thus far we have confined ourselves to the extraction of causal relation- 
ships from quantifier-free state constraints. As for the general case, notice 
first that since we assume finiteness of sets of entities f, it is always possible 
to rewrite any state constraint so as to become quantifier- free. Namely, each 
sub-formula \/x.F is replaced by f\e^£ F {x e} , and each sub-formula 
3a;. F is replaced by VeG£ Awkward as it is, this is unavoidable 

when dealing with unrestricted state constraints and influence information. 
To see why, consider some arbitrary formula like \/x3y\/z. f{x, y, z) as state 
constraint. Which causal relationships are determined by this constraints de- 
pends on how the various possible instances /(ei, 62 , 63 ) interact. Each of 
the instances may behave differently with this respect. In general, therefore, 
there is no better way than grounding a state constraint and proceeding as 
above. On the other hand, it is possible to exploit the expressiveness of causal 
relationships having variables in case a state constraint and influence infor- 
mation obey well-defined restrictions. In the following, we discuss two such 

As the size of a formula we take the number of literal occurrences. 
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classes which we consider of general importance so that a lot is gained by 
their special treatment. 

First, consider state constraints of the form Vxi...Va;„. F (or Wx-F, 
for short) with sub-formula F being quantifier-free. Suppose further that 
for any atomic fluent expression f[x] occurring in F, the underlying in- 
fluence information does not differ regarding different instances /[e]. Then 
the causal relationships determined by this constraint and influence in- 
formation can be obtained by applying our procedure to sub-formula F 
with all occurrences of variables taken as (distinct) entities.^"* As a sim- 
ple example, consider state constraint Va:[down(x) = -'up(x)] and influ- 
ence information I = {(up(a;), down(x)), (down(x), up(x))}. Four causal re- 
lationships are extractible from the corresponding quantifier- free constraint, 
down(a:) = -'up(a;), viz. 



down(x) 


causes 


^up(a;) 


if 


T 


'down(x) 


causes 


up(a;) 


if 


T 


up(x) 


causes 


-idown(a;) 


if 


T 


->up(a;) 


causes 


down(a;) 


if 


T 



stating that any entity becoming up (or not up) also becomes not down 
(or down, respectively) and vice versa. 

Second, and more sophisticated, we analyze state constraints in which 
each occurrence of a quantifier is of the form Vir. f[x] or of the form 3a;. £[5;] 
where ^[a;] is a (possibly negated) fluent expression with free variables x. 
Furthermore, it is assumed that, for any such i\x], the underlying influence 
information does not differ regarding different instances ||f^[e]||. When com- 
puting the minimal CNF of a state constraint which is restricted in this way, 
sub-formulas Va;. ^\x] and 3a;. ^\x] are treated as if they were ordinary liter- 
als. If in addition any Fix.i\x] is replaced by 3x.->£\x] and any ~^3x.£\x] 
by Va;. -i£[a;], then each conjunct in the resulting CNF is a disjunction con- 
sisting of ground literals and expressions of the form Vx. £[a;] and 3x.(.\x]. 
When generating causal relationships, the sub-formulas with quantifier are 
treated as follows: The reason for a formula Va;. ^\x] becoming false must be 
the occurrence of any effect Thus ^i\x] itself may be used, if influ- 

ence permits, as triggering effect in a causal relationship. The reason for a 
formula 3a;. ^\x] becoming false must be the occurrence of any effect ~'i\e\ so 
that (now) Vx. -^^\x] holds. Thus -^(i\x] may be used as triggering effect in a 
causal relationship whose context includes Va;. -i^[a;] . Correcting the violation 
of a state constraint by satisfying a formula 'ix. is achieved by satisfying 
all instances of ^\x]. Thus f[x] may be used as indirect effect in a causal 
relationship. Correcting the violation of a state constraint by satisfying a for- 
mula 3a;. £[a;] is achieved by satisfying just one instance of t\x]. Thus (.\x] 

During this procedure, the underlying influence information should of course 

be assumed to treat these entities similar to all others as regards the aforemen- 
tioned atomic fluent expressions f[x] contained in F. 
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input C : state constraint; 

X : influence information; 

output TZ : set of causal relationships; 

begin 

let TZ — {}; 

let Cl A ... A Cn be the minimal CNF of C ; 
for all i = 1 to n do 

let Fi V ... V F™ = Ci ; 
for all j = 1 to m do 

for all k = 1 to m, k ^ j do 
let<P = /\ -.Fi ; 

I = 1, . . . , m 
I ^ j, I ^ k 

case Fj is iji 

case Fk is lk‘- let r = -•ij causes it if 

case Fk is Vxk- £k[xk]: let r = -itj causes ik\xk] if 

case Fk is 3xk.ik\xk\- 

let r = -ifj causes ik\xk] if VTfc. -'tk\xk] A F 
case Fj is Vxj.ij[xj]: 

case Fk is £k: let r = ~'tj\xj] causes £k if F\ 

case Fk is \fxk- tk\xk]‘- let r = -'£jlxj] causes £k[xk] if F; 

case Fk is 3xk-£k[xk]-- 

let r = ~'£j[xj] causes £k[xk] if Vxk- ~^£k\xk] A F 
case Fj is 3xj.£j[xj]: 

case Fk is £k‘. let r = ~'£j\xj] causes £k ff VTj. ~^£j\xj] A F\ 
case Fk is Vxk.£k[xk]'- 

let r = -'£j[xj] causes £k[xk] if Vaij. ~'£j[xj] A F; 
case Fk is 3xk.£k[xk]‘- 

let r = -'£j{xj] causes £k[xk] if VTj. ~'£j[xj] A \fxk- ~'£k[xk] A F 
if (l|fi[%]ll, Il4[®fc]||) €X then 
let TZ — TZ yj {r} 

end-for 

end-for 

end 

Figure 2.7. An extended procedure for the automatic extraction of causal rela- 
tionships. 



may be used as indirect effect in a causal relationship whose context includes 
VT. -<£\ x ] . On this basis, an extended algorithm for automatic generation of 
causal relationships is depicted in Fig. 2.7. 

Example 2.5.4- Consider state constraint C = light = Vx.up(x), ex- 
pressing the serial connection of all involved switches and a light bulb. Let 
I = {(up(x), light)}, then our extended algorithm traces as follows: 




2.6 Non-minimal Successor States 



43 



The minimal CNF of C is 

(3x. -'up(a;) V light) A (Vy. up(y) V -■light) 

1. Regarding conjunct Ci = 3x. -■up(a;) V light we obtain the following: 

- In case j = l,k = 2 we have (up(a;), light) G I, which yields 

up(a;) causes light if \/x. up(a;) 

- In case j = 2,k = I we have (light, up(x)) ^ I. 

2. Regarding conjunct C '2 = V?/.up(j/) V -ilight we obtain the following: 

- In case j = 1, k = 2 we have (up(j/), light) G I, which yields 

~'up{y) causes -ilight if T 

- In case j = 2,k = I we have (light, up(y)) ^ I. 

Thus closing either switch causes light provided all other switches are closed; 
and conversely, opening either switch causes the light be off afterwards re- 
gardless of other switches — as one would expect. ■ 

With this extension of our basic algorithm we conclude the discussion on 
how causal relationships may be automatically generated whenever suitable 
influence information can be provided. It is, however, not claimed that the 
latter is always possible with our basic notion of influence information. More 
sophisticated means to specify potential influence may be required in certain 
domains. E.g., a fluent having the potential to affect another fluent may 
depend on whether the former occurs affirmatively or negated. To reflect this, 
the concept of influence information needs to be extended so as to relate fluent 
literals and not just fluents. Further generalization may allow for restricting 
potential influence to circumstances expressed by arbitrary fluent formulas.^® 
Whichever sophistication is required depends on the domain at hand. The 
simpler the notion of potential influence — provided it suffices — , the more is 
gained by the automatic extraction of causal relationships compared to these 
drawing up all by hand. 



2.6 Non-minimal Successor States 

The attempts to solve the Ramification Problem which we have discussed 
prior to causal relationships were based on the idea of minimizing change to 
the largest reasonable extent. The employment of causal relationships does 



Notice that this is unnecessary in case the circumstances follow from the state 
constraints. Of course, fluent relay can affect up(s 2 ) only in case relay is 
true (in our circuit depicted in Fig. 2.4). However, the very state constraint 
relay 3 -iup(s 2 ) encodes this knowledge, which is why it needs not be inte- 
grated into the information of potential influence. 
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not, at least not a priori, respect a similar notion. Since the principle of min- 
imizing change is widely considered essential for the Ramification Problem in 
literature, the present section is devoted to an analysis of whether and how 
this principle and the successive application of causal relationships relate. 
It will turn out that causal successor states cover all states with minimal 
distance to some preliminary successors in satisfying the state constraints 
while respecting causality. Yet we will also show that the converse does not 
hold. That is, requiring minimality is proved failing to account for all possible 
causal successor states in some domains. This result challenges the common 
belief in the necessity of minimizing change when addressing the Ramification 
Problem. 

For a general solution to the Ramification Problem, as we have seen all 
along, mere state constraints contain insufficient information. A suitable min- 
imization strategy therefore requires additional domain knowledge. The pur- 
pose of this knowledge is to help telling apart derived implications which 
correspond to actual indirect effects from those which are just logical con- 
sequences. For example, up(s;[) A up(s 2 ) D light, derived from state con- 
straint light = up(sj^) A up(s 2 ), indicates a causally correct implication, 
whereas up(s;[) A -ilight D -■up(s 2 ), derived from the same constraint, 
does not. Though being logically equivalent, the two implications need to be 
distinguished when minimizing change while accounting for indirect effects. 
Causal relationships serve this purpose if taken as directed implications. Un- 
like classical material implications, these directed implications must not be 
applied in reverse direction. That is, while the two fluent formulas F A £i D £2 
and F A -'£2 A -^£1 are interchangeable, the semantics of the correspond- 
ing relationships £i causes £2 11 F and -^£2 causes -i£i if F shall differ 
also in their logical reading. This reading of causal relationships as directed 
implications shall be the basis for a formal definition of minimizing change 
with regard to causal knowledge. This first of all requires a notion of how 
to employ causal relationships as deduction rules. Similar to the operational 
reading of causal relationships, this requires to distinguish between a con- 
text and a set of actually occurred effects. For the purpose of deduction, the 
former is given as a set of arbitrary fluent formulas F while the latter is, 
as before, a set of fluent literals, 0. In what follows, if F is a fluent for- 
mula, then we say that a set of ffuent formulas F entails F if F is true 
in all states satisfying F. By Hi{F) we denote the set of all formulas thus 
entailed by F (that is, the theory of F). The derivable consequences of a 
context-effect pair given underlying causal relationships is then defined as 
follows. 

Definition 2.6.1. Let S and T be sets of entities and fluent names, respec- 
tively, and TZ a set of causal relationships. If F is a set of fluent formulas 
and 0 a set of fluent literals, then the theory induced by (F, 0) relative 
to TZ, written nin{F,0), is the smallest (wrt. set inclusion) pair {F,0) 
such that 
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1. If' C S' and 0 C 0; 

2. W = Th{'P); and 

3. for any instance e causes if ^ G 7^ such that <P G ^ and e G 0, we 
have g G 'P and g G 0. 

If “Th'R.{'P,0) = {P,^) arid F G'P , then this is denoted by (fP,0) II" 7?, ■ 

It is easy to see that 1 1-7^ is monotonic, that is, if'i C F 2 and 0\ C 6*2 
implies {F: (Fi,0i) lh7 ^F}C{F: (^2,02) \^nF} . 

Example 2.6.1. Let £ = {s]^,S2} and F = {up^, light°}, and let TZ consist 
of the causal relationships reflecting the causal dependencies between the two 
switches and the light bulb in Fig. 2.3, i.e., 

up(s]^) causes light if up(s2) -'up(s;[) causes ->light if T 

y • .1. vy j 

up(s2) causes light if up(s;[) -iup(s2) causes -ilight if T 

Now, consider the set Fi = {up(s;[) A up(s2)} along with 0 = {up(s]^)}. 
Then Thn{F\,0) = (77i({up(sj^)Aup(s2), light}), {up(si), light}); hence, 
{'Fi,0) 11-7^ light. In contrast, suppose F 2 = {up(si) A -•light} and, as be- 
fore, 0 = {up(si)}, then ViTz{'p 2 , 0) = (77i({up(si) A -ilight}), {up(si)}) 
since no causal relationship is applicable. Hence, (if'2,0) F7^->up(s2). ■ 

The notion of performing deduction on the basis of causal relationships 
shall be exploited for the construction of a flxpoint characterization of suc- 
cessor states which accounts for indirect effects while respecting causal in- 
formation. Informally speaking, suppose an action with direct effect E is 
performed in state S. Then a state T is considered successor iff the follow- 
ing holds. Set T includes E, T along with E and the underlying causal 
relationships do not entail an inconsistency, and each fluent change from S 
to T is grounded on some causal relationship. This last condition reflects 
the idea of minimizing change. 

Definition 2.6.2. Let (£,F,A,C) be a basic action domain and TZ a set 
of causal relationships. If S is a state and a an action, then a state T is 
causal minimizing-change successor of S and a iff the following holds: Set C 
contains an applicable action law instance a transforms C into E such that 

T = {£ : {{SnT)UE,E) \hn i} (2.11) 

that is, T is flxpoint of the function XT.{£: {{Sr\T)UE,E) ll"7^ £} given 
S and E. ■ 

Example 2.6.2. Let T> be the basic action domain which models our cir- 
cuit composed of two switches and a light bulb of Fig. 2.3 as in Exam- 
ple 2.2.2. Furthermore, let TZ consist in the four causal relationships (2.10) 
above. Suppose that the current state be S' = {-•up(s;L), up(s2), -•light} 
as depicted in Fig. 2.3. The only applicable action law instance for ac- 
tion a = toggle(s]^) is toggle(sj^) transforms {-•up(sj^)} into {up(s;[)} 
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with effect E = {up(s;l^)}. Then the only causal minimizing-change succes- 
sor of S and a is T = {up(sj^),up(s 2 ), light}. For {{S T) \J E , E) = 
({up(s 2 ), up(s j^)}, {up(s^)l) allows to derive light. In contrast, the unin- 
tended T' = {up(s;[), -iup(s 2 ), -'light} does not satisfy Equation (2.11). 
For ((S' n T') \J E,E) = ({-■light, up(s;[)}, |up(s]^)}), which does not allow 
for deriving ->up(s 2 ), i.e., the literal which is missing to regain T ' ■ 

The following observation justifies formally our name “minimizing-change 
successor.” Namely, any state T satisfying Equation (2.11) has minimal dis- 
tance from S in that there is no state T' with less (wrt. set inclusion) 
changes while also satisfying this equation. 

Observation 2 . 6 . 1 . Let £ and T be sets of entities and fluent names, TZ 
a set of eausal relationships, S a state, and E a eonsistent set of fluent 
literals. For any two fixpoints T, T' of XT. {£ : ((S DT) U E, E) £} , if 
T\SDT'\S, then T = T' . 

Proof. Since each of S,T,T' is a state, T \ S = T' \ S implies T = T' . 
The assumption T \ S ^ T' \ S leads to the following contradiction. Let 

i G T \ S such that i ^ T' \ S , then t G T and £ ^ S, hence ~^£ G S 

and ~^£ G T' as both S and T' are states. Since T' is consistent and 

fixpoint, we have {{SC\T') U E, E) IFt^ £ due to ~^£ GT' . On the other hand, 

we know that {{S fl T) U E,E) \\-n(- due to ^ G T. Operator \\~tz being 
monotonic, ((S'nT')Uif, E) and {{S E , E) \\-n^ together imply 

S' n T 2 S n T' . Thus we can find some £' G S,T such that £' ^ T' . Then 
-■/ ^ S,T and -■£' G T', which contradicts premise T \ S G) T' \ S. Qed. 

Before we enter the analysis of the relation between minimization based 
on causal relationships on the one hand, and their operating on preliminary 
successors on the other hand, we present an iterative characterization of the- 
ories induced by causal relationships. This alternative view will be helpful for 
later purpose. 

Proposition 2 . 6 . 1 . Let £ and T he sets of entities and fluent names, 
respectively, and TZ a set of eausal relationships. For each pair {F, 0) con- 
sisting of a set of fluent formulas and a set affluent literals, we define 

1. Fq = F and 0 q = 0 . 

2. For i> Q, 

a) Fl = Th{Fi-i)\j{g : e causes piiF gTZ,F G Fi^i, g ^ Fi_i,e G 0i-i} 

b) 0i = 0i-i£){g : e causes g \j_L> G TZ, F G Fi^i, g ^ Fi^i, e G 0i-i} 

Then 'f' = Uto 0 = Ui^o where {F,0) = Th-ji{F,0) . 

It is also interesting to see why T" = {up(s;j^),up(s 2 ), -■light}, where only 
the direct effect is computed, does not satisfy (2.11): Pair {{S n T”) D E, E) — 
({up(s 2 ), -'light, up(s]^)}, (up(s;]^)}) additionally entails light with the help 
of TZ. Thus, T" is not a fixpoint. This illustrates that all material implica- 
tions F A e D Q induced by some causal relationship e causes g ii F hold in 
minimizing-change successors. 
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Proof. 

1 . We first prove W C 1J“ q 'Pi and 0 C 1J“ g 0i . 

a) Let F G 'P. Then minimality of {'P,0) implies the existence of 

a finite sequence £i causes gi if causes Qn if of 

causal relationships in TZ such that F G U {gi, . . . , 

and, for each 1 < t < n, we have <Pi G 77i(<? U {q\, . . . , Qi-i\), 

Qi ^ 77i(<? U {pi, . . . , (because of minimality of 0) and 

£i G 0 U {pi, . . . , Qi-i}. Consequently, F G 77i('f„+i) C if-. 

b) Let £ G 0. An argument similar to the one in clause (a) ensures that 

£ & 0n Q Uto some n G INg. 

2. To show F G) IJ^gifi and 0 G) we prove by induction that 

Pn Q 'P and 0n C 0 for all n G INg. The base case, n = 0, holds by 

definition since Fq = F GG F and 0q = 0 Q 0- Suppose n > 0 and 

assume that the claim holds for n — 1 . 

a) If F G 7Ti{Fn-i), then the induction hypothesis F^-i Q F implies 
F G Th{F) = F. 

b) If F = p for some £ causes g \f_F G TZ such that <P G >f„_i and 
£ G 0„_i, then the induction hypothesis F„_i C F and 0„_i C 0, 
respectively, implies F G F and e G 0. Consequently, g G F and 

Qed. 

We are now prepared for a formal proof of the announced subsumption 
result. That is, we will show that the successive application of causal relation- 
ships to preliminary successors enables us to encounter all causal minimizing- 
change successors. 

Theorem 2.6.1. Let (S,F,A,C) be a basic action domain and C and TZ 
be sets of state constraints and causal relationships, respectively. Furthermore, 
let S be an acceptable state and a an action. Then each causal minimizing- 
change successor of S and a is also causal successor state. 

Proof. Let F be a causal minimizing-change successor obtained through ac- 
tion law a transforms C into E G C. Let F = {S C\ T) U E and 0 = E, 
then T = {£ : ((F fl F) U F, F) If- 7 ^ £} implies Th{T) = F = lj“g Fi where 
{F,0) = Thn{P, O). 

We prove by induction that, for each n G INg, ((S'\C')UF, F) (S'n, F„) 
for some (S'„,F„) such that C 77i(S'„) and 0„ = F„. Notice that each 
Fn {n > 0) is consistent as C Th{T) and F is a state. 

In case n = 0, let Sq = {S\C)U E and Fg = F, which together satisfy 
the claim: Consistency of Fq = (S DT) U E and {S\C)U E being a state 
implies (S'nF)UF C (S'\C')UF, hence Fg C Sq. Set Fg equals 0g by 
definition. 

For the induction step, let n > 0 and suppose that some (S'„_i,F„_i) 
satisfies the claim. Let 
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{Ql,...,gm} = (2.12) 

be the set of all fluent literals that are added to to obtain 'I'n- Then 

there exist m causal relationships £i causes if , £m causes Qm if 

such that <Pj G 'f'n-i, Qj ^ and Sj G 6>„_i for each I < j < m. 

Consider the first relationship, i.e., ri = £i causes gi if According to 
the induction hypothesis, we know 'I'n-i Q 77i(S'„_i) and 6>„_i = En-i- 
Consequently, is true in S'„_i and £i G En-i- Moreover, we have that 
-'Qi G Sn-i, for assuming the contrary leads to the following contradiction. 
Given that gi G in case gi G En-i we find that gi G tffn-i due to 

E„-i = 0„-i and 0„_i C — which contradicts gi ^ 'f'n-i according 

to Equation (2.12). In case gi ^ E„_i, we have gi G Sn-i implies gi G S. 
From gi G En C U“g Ei = T it follows that gi G {S (IT) C Eq C En-i — 
which again contradicts gi ^ according to Equation (2.12). 

Thus we can apply relationship ri to the state-effect pair (S'„_i, E„_i). 
Consider, now, the second causal relationship V 2 = £2 causes g 2 if f^ 2 . Just 
like ri, relationship £2 is applicable to (S'„_i, E„_i) . We have to prove 
that it is still applicable after having applied ri. Condition £2 G E„_i 
does not become violated, for otherwise we had £2 = ~'Pi, which would 
imply ->gi G En-i = On-i Q Tn-i C En, which contradicts gi G tffi 
according to Equation (2.12). Condition - 1^2 G 5'„_i as well does not be- 
come violated, for otherwise we had gi = g^, which contradicts the left 
hand side of Equation (2.12) being a set. Finally, condition <^2 G 77i(S'„_i) 
does not become violated because <?2 G and C 77i(S'„_i) imply 

<^2 G 77i((S'„_i \ {-'Pi}) U {gi}) since does not contain gi nor -^g^. 

Thus we can successively apply all m causal relationships to (Sn-i, E„_i). 
The overall resulting state-effect pair (S'„, E„), where 

Sn — (‘5'n — 1 \ ... 5 ~'Qm}) C {^^1, . . . , Pm} 

En = Eji—i U {pi, . . . , Pm} 

satisfies the claim because tf'n C 77i(S'„) and 0^ = En- 

Now, since there exists only a finite number of changes from S to T , 
we have T = {(. : I G U^o ^*} = {f • f £ ^«} for some smallest number 
n G INq. The induction proof ensures the existence of some (Sn,En) such 
that En C Th{Sn) ■ Because {£: £ G En} = T is a state, this implies T = Sn- 
Consequently, {{S \ C) U E,E) (T,En), that is, T is causal successor 
state. Qed. 

While this result shows that causal relationships allow to obtain any 
causal minimizing-change successor, the converse is not true. That is, there 
may exist causal successor states which do not obey the request for mini- 
mizing change. To illustrate this, we further extend (for the last time) our 
electric circuit. This new example shows that minimizing change might be a 
policy too restrictive to account for all possible successor states. It is obvi- 
ously necessary to consider a non-deterministic action to this end, for there 
must be at least two successor states, one of which is non-minimal. 
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Example 2.6.3. The electric circuit of Example 2.3.2, depicted in Fig. 2.4, 
is slightly modified and further augmented by a light detecting device that 
becomes and stays activated as soon as light turns on; see Fig. 2.8. Accord- 
ingly, the basic action domain of Example 2.3.2 is extended by fluent name 
detect*^. The new arrangement is formalized by these four state constraints: 

light = up(si) A up(s2) 
relay = up(si) A up(s 3 ) 
relay Z) -iup(s 2 ) 
light D detect 

Obviously, potential influence is as in our predecessor circuit but with 
additionally having that the state of the light bulb may affect the de- 
tector. After enhancing influence information X of Example 2.3.2 by pair 
(light, detect), the above state constraints determine the following causal 
relationships TZ according to our generation procedure. 



up(s]^) causes light if up(s 2 ) 
^up(s]^) causes flight if T 



up(s 2 ) causes light if up(sj^) 
^up(s 2 ) causes -flight if T 



up(s]^) causes relay if up(s 3 ) 
->up(s]^) causes ->relay if T 



up(s 3 ) causes relay if up(sj^) 
-iup(s 3 ) causes -irelay if T 



relay causes -iup(s 2 ) if T light causes detect if T 

Now, suppose we toggle the first switch, S;^, in the state depicted in Fig. 2.8. 
What is the expected outcome? Obviously, the relay gets activated and, then. 




Figure 2.8. A modified electric circuit (c.f. Fig. 2.4) augmented by a device, repre- 
sented by fiuent detect, which registers an activation of the light bulb (this device 
combines a phototransistor and fiipfiop). It is assumed that no action of light has 
occurred yet. 
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attracts the second switch, sw2- Hence, the latter is open in the finally re- 
sulting state. Notice, however, that as soon as the first switch is in the upper 
position, the sub-circuit involving the light bulb gets closed. This may ac- 
tivate the light bulb for an instant, that is, before the second switch jumps 
its position as a result of the relay activation. If this is indeed the case, 
then this ‘flash’ might be registered by the photo device. Hence, while it is 
clear that the light bulb is off in the resulting state, fluent detect possi- 
bly becomes true.^"^ Accordingly, two causal successor states exist for ini- 
tial state {-iup(s]^), up(s2), up(s3), -ilight, -relay, -idetect} and action 
toggle(s;]^), which are obtained as follows. The unique preliminary successor 
state along with the corresponding direct effect is 

({'ip(si),up(s2),up(s3), -.light, ^relay, -.detect}, {up(si)j) (2.14) 

The first component violates both the first and the second constraint of (2.13). 
There are several ways to proceed. First, we can apply causal relationship 
up(s;[) causes relay if up(s3) followed by relay causes -.up(s2) if T, which 
results in 



( {up(si),-.up(s2),up(s3), relay, -.light, -.detect} , 

{up(si), relay, ^up(s2)} ) 

The first component satisfies all of the state constraints and, consequently, is 
a causal successor state. 

Another possibility to process the state-effect pair (2.14) is to apply the 
following sequence of causal relationships: 

up(sj^) causes light 
up(s;[) causes relay 
relay causes -.up(s2) 

-.up(s2) causes -.light 

In words, we first conclude the light bulb turns on due to the second switch 
being on. However, since the activation of the relay causes up(s2) to become 
false, we have to ‘turn off’ the light bulb again via the finally applied causal 
relationship. In this way we obtain the following overall resulting state-effect 
pair. 

( {up(si),-.up(s2),up(s3), relay, -.light, ^detect} , 

{up(si), relay, -.up(s2), -.light} ) 

Though different from the chain of causal relationships we considered first, 
the extended chain results in the identical causal successor state. But we have 

What actually happens depends on the physical properties of both the relay and 
the light detector. Lacking this precise information, common sense interprets 
the situation as giving rise to non-determinism. See the following section for a 
detailed discussion on this phenomenon. 



if up(s2) 
if up(s3) 
if T 
if T 



(2.15) 
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not considered all possibilities yet. Notice that light becomes true and is 
among the current effects after having applied the first one of the causal 
relationships (2.15), and it stays true until the last one is applied. Therefore 
we may insert causal relationship light causes detect if T somewhere 
in between. This additionally causes detect to become true, that is, the 
following state-effect pair can also be generated: 

( {up(si),^up(s2),up(s3), relay, -.light, detect} , 

{up(s;]^), detect, relay, -.up(s2), -.light} ) 

Its first component is acceptable and different from the causal successor 
above. No further causal successor states can be obtained. 

Now, the first one of the two successors, i.e., 

Ti = {up(si),-.up(s2),up(s3), relay, -.light, ^detect} 

is strictly closer to initial state S than the second one, i.e., 

T2 = {up(si),-.up(s2),up(s3), relay, -.light, detect} 

This indicates that the latter cannot be a causal minimizing-change successor. 
Indeed, while Ti is fixpoint of XT.{£: {{Sr\T)UE,E) \\--jz (with E = 
{up(s;[)}), state T2 is not because 

{{S nT2)VJ E,E) = ({up(s3), -.light, up(si)}, {up(si)} detect 



This domain proves that minimization is not always a concept adequate for 
distinguishing between possible indirect effects on the one hand, and un- 
founded changes on the other hand. This observation challenges the common 
belief that minimizing change is essential for solving the Ramification Prob- 
lem. In fact, the aim of generating ramifications is not to minimize change 
but to avoid changes that are not caused, which, as we have seen, need not 
be identical goals. 



2.7 Triggered vs. Coupled Effects 

The instructive example concluding the preceding section shows how causal 
relationships often literally run a race. A different finish sometimes means 
a different successor state, thus giving rise to uncertainty in form of non- 
determinism. This raises the question whether it might be overly credulous 
to consider possible any order in which these relationships are applied. Not 
all computed chains of indirect effects may be equally likely to happen in 
reality. This would be the case if the causal lag between some particular 
indirect effect and its triggering cause is generally shorter than the causal lag 
between another particular effect and its cause. Suppose, for instance, our 
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relay is known to take its time for warming up until it reaches the power to 
attract the switch, whereas the light detector registers the shortest flash with 
utmost alertness. Then it seems unreasonable to consider the possibility that 
the detector stays off when toggling switch in the situation depicted in 
Fig. 2.8. 

One approach to this problem is to introduce an explicit notion of time, 
namely, in specifying the exact delay between the occurrence of an effect and 
its cause. This, however, is not in the spirit of the Ramification Problem. 
For the latter is concerned with accounting for those indirect effects which 
so rapidly follow the performance of an action that common sense consid- 
ers them virtually instantaneous. Had we assumed precise knowledge of the 
relay’s delay, then its attracting switches fell into the category of so-called 
“delayed” effects. Though certainly of importance, too, these effects need to 
be meticulously distinguished from those indirect effects investigated in the 
context of the Ramification Problem. For delayed effects deserve a separate 
state transition, that is, they should not occur during the same transition step 
as its triggering effect. In contrast, solving the Ramification Problem means 
to account for all indirect effects that are summarized in a single state tran- 
sition. This amounts to performing qualitative reasoning about causal lags, 
as opposed to quantitative reasoning, which would require precise knowledge 
of virtually indistinguishable time intervals. Qualitative reasoning, which ac- 
knowledges the fact that common sense often lacks precise knowledge, con- 
siders equal all causal lags greater than zero. 

The very last remark, however, reveals one critical aspect which we 
have neglected hitherto. Causal lags may indeed differ even qualitatively, 
namely, in possibly being zero. An important category of indirect effects 
obviously of this nature is effects triggered by so-called definitional state con- 
straints, which we have already touched upon in Section 2.4. Recall constraint 
Va;[down(x) = -'up(a;)]. Having some instance like up(s]^) as direct or indi- 
rect effect, this gives rise to additional indirect effect -idown(s]^). The causal 
lag between these two effects is zero — not even for the tiniest fraction of time 
a state is imaginable where up(s;]^) already and down(sj^) still hold. The to- 
tal absence of a causal lag qualitatively distinguishes this ramification from 
most ones we have considered throughout the previous sections. Let us call 
coupled all effects which occur with causal lag zero, as opposed to triggered 
effects. Failing to account for this distinction may lead to surprising conclu- 
sions even if granted that, as we have argued, temporal differences between 
causal lags are to be neglected. The following scenario illustrates why coupled 
and triggered effects must not be treated alike. 

Example 2.7.1. Suppose a bowl well filled with soup is standing on a rectan- 
gular table; see Fig. 2.9. Whenever the left hand side of the table is lifted up 
but not the right hand side (or vice versa), then the spilling soup stains the 
tablecloth. If, however, both sides are lifted up simultaneously, then no soup 
is expected to spill out. Let us model this scenario by a basic action domain 
consisting of entities Ihs and rhs (left and right hand side of the table). 
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-istain 




Figure 2.9. A bowl filled with soup is standing on a table. The soup spills out and 
produces a stain if the table is lifted on one hand side but not the other. Nothing 
of this sort is expected when lifting up the table on both sides simultaneously. 



fluent names up^, down^, and stain*^, action name lift-both-sides°, and 
action law 

lift-both-sides transforms {down(lhs), down(rhs)} 

into {-'down(lhs), -idown(rhs)} 

Furthermore, let the set of state constraints consist of 



Va;[down(a:) = -'up(a;) ] 

up(lhs) A -iup(rhs) V up(rhs) A -iup(lhs) D stain 

Influence information 1= {(up(a;), down(x)), (down(x), up(x)), (up(a;), stain)} 
plus the constraints determine the following causal relationships: 



down(x) causes -'up(a;) if T 

up(x) causes -idown(a;) if T 

^down(x) causes up(x) if T 

->up(x) causes down(a;) if T 



up(lhs) causes stain if ^up(rhs) 

up(rhs) causes stain if -iup(lh.s) 

->up(rhs) causes stain if up(lhs) 

-iup(lhs) causes stain if up(rhs) 



(The interested reader may verify this being the output of our algorithm of 
Fig. 2.6.) 

Now, consider the state where the table stands firmly on the floor and 
there is no stain, S = {-■up(lhs), down(lhs), -iup(rhs), down(rhs), -istainj, 
as depicted in Fig. 2.9. Performing lift-both-sides results in the prelim- 
inary successor 

S' = {-iup(lhs), -idown(lhs), -iup(rhs), -idown(rhs), -istain} 

obtained through the direct effect if = {->down(lhs), -idown(rhs)}. This pre- 
liminary state violates the state constraints. The two possible instances of 
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causal relationship ^down(a:) causes up(a:) ^ T are applicable to the state- 
effect pair (S',E). Suppose we first apply instance {x ihs}. This yields 
the state-effect pair 

( {up(lhs), ->down(lhs), -iup(rhs), -idown(rhs), -istain} , 
{-idown(lhs), -idown(rhs), up(lhs)} ) 

We can proceed with the second instance, {x i— >■ rhs}, of the above relation- 
ship, thus obtaining {up(lhs), -idown(lhs), up(rhs), -idown(rhs), ->stain} as 
a causal successor state. Yet we can also first insert the application of 
up(lhs) causes stain if -iup(rhs). Followed by the conversion of ->up(rhs), 
we thus obtain {up(lhs), -idown(lhs), up(rhs), ->down(rhs), stain} as an- 
other possible causal successor, where surprisingly a stain has been produced! 
The identical two causal successor states can be obtained in case relationship 
instance -idown(rhs) causes up(rhs) if T is applied first, allowing to produce 
a stain via up(rhs) causes stain if -iup(lhs). ■ 

The general problem which causes the unexpected possibility of a stain 
here, is that causal relationships introduce an artificial causal lag between 
coupled effects. Some other causal relationship may exploit this lag for 
‘squeezing in’ a triggered effect which can never occur in reality. As a conse- 
quence, our approach to the Ramification Problem needs to be refined in that 
it respects the distinction between virtually and truly instantaneous indirect 
effects. 

To this end, some state constraints are distinguished as being steady. All 
definitional constraints enjoy this property, but others might as well. Con- 
sider, as an example, the situation depicted in Fig. 2.10. The corresponding 
state constraint Vx [ location(a, x) = location(b, x) ] is to be regarded as 
steady. The criterion for characterizing a state constraint as steady is that not 
even for an instant a situation is imaginable where this constraint is violated. 
All indirect effects determined by steady state constraints have causal lag 
zero, i.e., are coupled. As argued, during the application of causal relation- 
ships the insertion of an effect with real causal lag in between the generation 
of coupled effects needs to be prohibited. This is achieved by first apply- 
ing only causal relationships stemming from steady state constraints, until 
none of these constraints is violated. Only thereafter a triggered effect may 
be generated, again followed by accounting for all coupled effects necessary 
to satisfy the steady constraints, and so on until an overall acceptable state 
obtains. This strategy is formalized in the following definition of successor 
state. 

Definition 2.7.1. Let S, T, A, and C besets of entities, fluent names, ac- 
tion names, and action laws, respectively. Furthermore, let C he a set of state 
constraints and TZ a set of causal relationships, and let Cg Q C and TZg C TZ 
be designated sets of steady constraints and relationships, respectively. If S 
is an acceptable state and a an action, then a state S' is a successor of S 
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loc2 



Figure 2.10. These two blocks are connected by a firm bar. Whenever one of them 
changes its location, e.g., from locj^ to loc 2 , then the other block moves as well. 
Not even for an instant can the two blocks occupy different locations. Hence the 
state constraint Va; [ location(a, a;) = location(b, a:) ] is to be considered steady. 



and a iff the following holds: Set C contains an applicable action law in- 
stance a transforms C into E and there exist states Sq, Sq, . . . , S„, S!^ and 
sets affluent literals Eq,Eq, . . . ,En,E'„ (n>0) such that Sq = {S\C)U E , 
Eq = E, 

(So,Eo) iS^E',) ^^ . . . {S'„,E'J 

and, for each Q <i <n, S[ is acceptable wrt. Cs and S'„ is acceptable, m 

Example 2.7.2. Consider S = {lhs,rhs}, T = {up^, down^, stain®}, and 
A = {lift-both-sides®}, and let £ consist of 

lift-both-sides transforms {down(lhs), down(rhs)} 

into {-idown(lhs), -idown(rhs)} 

as above. Furthermore, let Cg = {Vx [down(x) = -'up(a;) ]}, and let C be Cg 
plus up(lhs) A -iup(rhs) V up(rhs) A -iup(lhs) D stain. Then the steady 
causal relationships T^s are 



down(cc) causes -■up(a:) if T 
->down(a;) causes up(a;) if T 

and 7?. is 'R.g plus 

up(lhs) causes stain if -iup(rhs) 
up(rhs) causes stain if ^up(lhs) 



up(cc) causes -idown(a;) if T 
->up(a;) causes down(a;) if T 

-iup(rhs) causes stain if up(lhs) 
->up(lhs) causes stain if up(rhs) 



As above, suppose S = {->up(lhs), down(lhs), -iup(rhs), down(rhs), -istain} 
be the current state. Performing lift-both-sides produces the following 
state-effect pair. 



( {->up(lhs), -'down(lhs), ->up(rhs), -idown(rhs), -istain} , 

{-idown(lhs), -idown(rhs)} ) 
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Its first component violates Cg, which is why first an acceptable state with 
this regard is to be found by means of TZg- This can only be achieved by 
applying, in either order, the two instances of -idown(a:) causes up(a:) T T. 
The resulting state-effect pair is 

( {up(lhs), -idown(lhs), up(rhs), -idown(rhs), -istain} , 
{-idown(lhs), -idown(rhs), up(lhs), up(rhs)} ) 

No further causal relationship, steady or non-steady, is applicable. The first 
component satisfies the entire set of state constraints C, hence constitutes 
the unique successor state. ■ 

Since now we finally arrived at a satisfactory solution to the Ramification 
Problem, let us extend the formal concepts of action domains, scenarios, and 
ent ailment so as to be able to specify and reason about domains involving 
indirect effects of actions. 

Definition 2.7.2. d. ramification domain V is a 6-tuple {£,iF,A,C,C,TZ) 
where {£, T, A, L) constitutes a basic action domain, C is a set of state 
constraints with a designated subset Cg of steady constraints, and TZ is a set 
of causal relationships with a designated subset TZg of steady relationships. 
The transition model of T> is a mapping S from pairs of an acceptable state 
and an action into (possibly empty) sets of states such that S' € S{S,a) iff 
S' is successor of S and a. m 

As solutions to the Ramification Problem are only concerned with the so- 
phistication of transition models, all concepts beyond can be adopted without 
modification. For the sake of completeness, let us reiterate them. 

Definition 2.7.3. A ramification scenario is a pair {0,T>) where T> is a 
ramification domain and O is a set of observations. An interpretation for 
{0,T>) is a pair {S,Res) where S is the transition model of V and Res is 
a partial function which maps finite (possibly empty) action sequences to 
acceptable states and which satisfies the following: 

1. Res([]) is defined. 

2. For any sequence a* = [oi, . . . ,afe_i,afc] of actions (k > 0), 

a) Res{a*) is defined if and only if Res{[ai, . . . ,ak-i\) is defined and 
S{Res{[ai , . . . , afe_i]), a^) is not empty, and 

b) Res{a*) € E{Res{[ai, . . . ,ak-i]),ak) . 

A model of a ramification scenario {0,T>) is an interpretation {E,Res) 
such that each o € O is true in Res, and an observation o is entailed, 
written O o, iff it is true in all models. m 

As a small exercise, the reader may consider the ramification domain mod- 
eling the electric circuit with the relay of Fig. 2.4, along with the scenario 
given by these two observations: 
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up(s 2 ) after [] 

^relay after [toggle(s 3 )] 

What can be concluded as to the initial state of the light bulb here? That 
is, is either one of the two observations light after [] or -ilight after [] 
entailed?^® 



2.8 Implicit Qualifications vs. Indirect Effects 

Sometimes the available causal relationships fail to produce even a single 
acceptable state. Rather than indicating an erroneous specification, the non- 
existence of a successor state tells us something about the executability of 
the action at hand. Recall that if no preliminary successor state exists, that 
is, if no action law is applicable, then this is an indication that some action 
precondition is not met. This interpretation transfers to the case where a 
preliminary successor exists but no overall successor state. Again, the action 
at hand is, for some reason, not qualified. That is, aside from the precondi- 
tions explicitly mentioned in action laws, additional, implicit preconditions 
are being revealed by the failure to generate an acceptable state. In causing 
the unsatisfiability of the state constraints, these supplementary action qual- 
ifications can be said to derive from state constraints — ^just like ramifications 
do. Although implicit preconditions are somehow opposite to indirect effects, 
being able to encounter them is vital when solving the Ramification Problem. 
For if a successor state is suggested in case some implicit qualification is not 
met, then this state is necessarily based on indirect effects which could never 
occur in reality. 

An exemplary scenario shall emphasize the distinction between implicit 
qualifications and indirect effects. An interesting feature here is that one and 
the same state constraint gives rise to both qualification and ramification. 

Example 2.8.1. The following is an extended version of the famous Yale 
Shooting domain, which describes a turkey-hunting. Suppose given the only 
entity turkey, the two fluent names alive^ and fleeing^, and the two 
action names shoot^ and startle^, which are accompanied by these two 
action laws: 

shoot(a;) transforms {alive(a;)} into {->alive(x)} 

startlefx) transforms j-'f leeine:fa;)l into Ifleein six)} 

The right answer is that light after M is entailed: Let {E,Res) be any model 
of the scenario. The two state constraints relay = ~iup(s;j^) A up(s3) and 
relay D -iup(s2) in conjunction with up(s2) € Res([]) imply that up(s]^) V 
-iup(s3) G i?es([]). Now, had we -mp(s]^) G 7 ?es([]) it would follow that 
-iup(s3) G Res([]) and, hence, up(s3), -iup(s]^) G Res([toggle(s3)]), which 
contradicts -irelay G Res([toggle(s3)]) . Thus we find that up(s;j^) G Res([]), 
which, given up(s2) G Res([]), implies light G i?es([]) according to state 
constraint light = up(s]^) A up(s2). 
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j alive(t), 

I -ifleeing(t) 



shoot (t) 



f ^alive(t), 1 
I ^fleeing(t) J 




Figure 2.11. Shooting the turkey (t) always has the direct effect -ialive(t) . If 
the turkey is in motion, then the action additionally causes -if leeing(t) . 



The (steady) state constraint Va;[f leeing(x) D alive(x)] restricts fleeing 
subjects to vivid ones. The correct information of influence in this context is 
I = {(alive(a;), f leeing(a;))}, that is, a change of alive(x) might affect the 
truth- value of f leeing(a;) but not vice versa. The constraint thus determines 
a single causal relationship, viz. 

-ialive(a;) causes -if leeing(a;) if T (2.16) 

Suppose action shoot (turkey) is performed in a state where the victim is 
still alive. In case the initial state is {alive(turkey), leeing(turkey)}, 
it suffices to generate the direct effect, {-ialive(turkey)}, of the corre- 
sponding action law instance. If, on the other hand, the initial state is 
{alive(turkey), f leeing(turkey)}, then the available causal relationship 
needs to be applied to accommodate an additional effect: Not only does the 
turkey drop dead, it also stops fleeing. The two situations are illustrated in 
Fig. 2.11. We see that our state constraint may give rise to indirect effects. 

In contrast, suppose we perform action startle(turkey) in a situation 
where the turkey idles. As above, if {alive(turkey), -if leeing(turkey} is 
the initial state, then the direct effect, f leeing(turkey) , of the correspond- 
ing action law instance suffices to obtain an acceptable state. On the other 
hand, consider initial state S = {^alive(turkey), ->fleeing(turkey)}. 
The only preliminary successor of S and action startle(turkey) is state 
S' = {-ialive(turkey), f leeing(turkey)}, obtained through the direct 
effect E = {fleeing(turkey)}. State S' violates our constraint. More- 
over, the only available causal relationship is not applicable here since 
-■alive(turkey) ^ E. Consequently, no successor state exists. In other words, 
our state constraint enforces the additional precondition that the turkey must 
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Figure 2.12. Startling the turkey is possible only if the animal is alive. This im- 
plicit precondition is encountered by failing to transform the preliminary successor 
{-ialive(t), fleeing(t)} into an acceptable state via the only available causal re- 
lationship (2.16). 



be alive if we want to startle it with success; see Fig. 2.12. We see that our 
constraint may also give rise to implicit qualifications. ■ 

This completes our formal account of the Ramification Problem. Let us 
summarize: A ramification domain contains a number of state constraints, of 
which some are steady, that give rise to indirect effects of actions. With the 
aid of general information as to possible influences among the fluents, the 
potential indirect effects, formalized as so-called causal relationships, can be 
automatically extracted from the constraints. This helps to distinguish the 
correct effects from both unmotivated changes and implicit qualifications. 
Causal relationships are successively applied to preliminary successor states 
until all indirect effects have been accounted for; in the course of this process 
the so-called coupled effects, deriving from steady state constraints, are to 
be generated prior to any so-called triggered effect. 



2.9 A Fluent Calculus Axiomatization 

In this section, we develop an axiomatization of our action theory by means 
of standard logic. The major motivation for so doing is that action theo- 
ries, as they stand, are less suited than general purpose logics when it comes 
to automating reasoning. The reason is that entailment relations of action 
theories are usually much more specialized and sophisticated because they 
reflect complex notions such as time, change, and causality. Moreover, when- 
ever an action theory is modified or extended in order to address additional 
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ontological aspects, then this necessitates a change of the original entailment 
relation. Axiomatizing the way conclusions are drawn from action scenarios 
by means of some general purpose logic provides an elegant circumvention of 
this problem. Any modification or extension of an action theory then amounts 
to modifying or extending the axiomatization; it avoids both changing the 
semantics of the underlying logic and adapting the inference engine. 

Our axiomatization shall be based on classical logic in conjunction with a 
representation technique called Fluent Calculus. The distinguishing feature 
of this technique is a special way of formalizing states, which allows high- 
est flexibility in view of defining how states are manipulated. This principle 
is introduced in the following Section 2.9.1. Section 2.9.2 is then devoted 
to axiomatizing action laws and causal relationships, and in Section 2.9.3 
we formalize all remaining notions, viz. observations, interpretations, and 
models of action scenarios. The entire axiomatization is accompanied by a 
mathematical proof of its correctness wrt. our action theory. 

2.9.1 Reifying States 

The common principle of Fluent Calculus-based formalizations of action the- 
ories is a particular axiomatization of the fundamental entity, namely, the 
state. A state is encoded as a term, accordingly called state term, which con- 
tains as sub-terms the fluent literals that are true in that state. This enables 
the use of first-order formulas to define how state terms evolve in the course of 
time. Representing fluents by terms instead of predicates follows a standard 
technique in logic known as reification, which generally allows for a restricted 
form of meta-reasoning without appealing to higher-order logic nor to exten- 
sions of classical logic. Sub-terms representing fluent literals compose state 
terms by employing a special binary function, which is illustratively denoted 
by the symbol “ o ” and is written in infix notation. For example, a term 
representation of the state {-'up(s]^), up(s 2 ), “ilight} is 

(->up(s]^) o up(s 2 )) o -ilight (2-17) 

where formally each n-place fluent becomes an n-place function, each entity 
becomes a constant, and where the negation symbol is a distinguished unary 
function. 

Obviously, it should be irrelevant at which position a fluent literal occurs 
in astate term. That is, (2.17) and up(s 2 )o(-iliglito-iup(s]^)), say, represent 
identical states. Moreover, double application of function -■ should always 
neutralize. This intuition is formally captured by stipulating the following 
formal properties of our special functions: 



'ix,y,z. 


II 

O 

o 

& 


o 

o 


(associativity) 


'ix,y. 


X O y = 


y o X 


(commutativity) 


Vx. 


X o 0 = 


X 


(unit element) 


Vx. 


-i-ix = 


X 


(double negation) 
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where the special constant 0 denotes a unit element for function o. This 
constant will prove useful as formal counterpart of the empty collection of 
fluent literals. The four axioms (ACIN, for short) constitute an equational 
theory. Given the law of associativity, from now on we omit parentheses on 
the level of o. Notice that the axioms ACIN formalize essential properties 
of the mathematical concept of a set: The order in which set elements are 
enumerated is irrelevant, too. From this perspective, constant 0 closely cor- 
responds to the empty set.^® For formal reasons, we introduce a function r 
which maps finite sets of fluent expressions A = {£i, . . . ,£„} to their term 
representation = t'l o • • • o (including = 0). 

In conjunction with the standard axioms of equality, equational the- 
ory ACIN entails the equivalence of two state terms whenever they are built 
up from an identical collection of fluent literals. By standard equality axioms 
we mean the following collection: 



X = X 

X = y D y = X 

X = y A y = z D x = z (2-18) 

Xi = y D f{xi,...,Xi,...,Xn) = f{xi,...,y,...,Xn) 

Xi = y D [P{xi, ...,Xi,...,Xn)= P{xi, ...,y,..., x„)] 

for each n-place function symbol / and predicate P (n > 1), and for each 
1 < i < n. All variables are universally quantified. 

ACIN plus these standard axioms alone, however, will not suffice for an 
axiomatization of our action theory. For it will also be necessary to prove 
unequal two state terms which do not contain the same fluent literals. That 
is, denials of equalities such as up(s;[)olight ^ lightoup(s 2 ) need also be 
derivable. Notice that this inequation is not entailed by the above axioms, for 
this would require the basic inequality ^ S 2 , which is nowhere granted. 
The standard so-called assumption of unique names provides us with these 
basic inequalities by means of additional axioms stating that syntactically 
different terms are never equal. The presence of an equational theory ne- 
cessitates a weakened version of this assumption. For up(s;]^) o light and 
light o up(s;[) , say, are syntactically different but considered equal by our 
axioms ACIN. A general concept known as unification completeness serves 
this purpose. For a formal introduction to unification complete theories see 
Annotation 2.1. The set of axioms which constitute an AClN-uniflcation 
complete theory allows to derive inequality of state terms not representing 
the identical collection of fluent literals. These axioms, which include ACIN 
and the standard axioms of equality, shall be called extended unique name 
assumption, abbreviated EUNA. 



The reader may wonder why we do not additionally reqnire fnnction o to 
be idempotent, i.e., V®. x o x = x, which the comparison with sets would 
naturally suggest. The (subtle) reason for this decision is given below, right 
after Proposition 2.9.1. 
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The formal definition of unification completeness requires some basic notions 
related to unification wrt. equational theories. Let E be an equational theory, 
i.e., a set of universally quantihed equations. Two terms s and t are said to be 
E-equal, written s =e t, iff s = t is entailed by E plus the standard equality 
axioms (2.18). A substitution a is called E -unifier of s and t iff sa =e ter ■ A 
set cUE{s,t) of iH-unifiers of s and t is called complete if it contains, for each 
i?-unifier of s and t, a more or equally general substitution. That is to say, for 
each substitution a such that sa =_b ta there exists some a' G cUE{s,t) with 
{a' <E cr) |var(s)uvar(t)- Here, Var{t) denotes the set of variables occurring in 
term t, and {o' <e <r)\v means the existence of a substitution 0 such that 
{o'6 =E o) \v ■ The latter holds iff for each variable x (z V the two terms {xo')0 
and xo are S-equal. A unification complete theory wrt. E is a consistent set 
of formulas E* containing the following: 



1. The axioms in E. 

2. The standard equality axioms (2.18). 

3. Equational formulas, i.e., formulas with “ = ” as the only predicate, such that 
for any two terms s and t with variables x the following holds: 

a) If s and t are not E-unifiable, then E* |= Sx. s = t. 

b) If s and t are E-unifiable, then for each complete set of E-unifiers 
cUE{s,t), 



E* ^ VT 



s = t D \J 3y. 
(tGcUe (s,t) 



(2.19) 



where y denotes the variables which occur in cr= but not in x. By (j= 
we denote the equational formula xi = ti A . . . A x„ = tn constructed 
from substitution cr = {xi i— >■ ti, . . . , Xn >— >■ t„}. If a is the empty sub- 
stitution, then (7= evaluates to formula T. 



Unification wrt. theory ACIN is well-understood. Whether two terms are ACIN- 
unifiable is decidable in general, and unihable terms always admit a hnite (but not 
necessarily singleton) complete set of unifiers. Several complete AC 1-unification 
algorithms are known. Extension to ACIN is straightforward. A unihcation com- 
plete theory ACIN* can be obtained by computing, for any two terms s,t, some 
complete set cEacin)^,!) of AClN-unifiers and adding the corresponding equa- 
tional formula which is to the right of the entailment symbol in (2.19). As a 
small example, consider the terms up(x) o a and up(sj^) o up(s 2 ) ° -ilight. The 
set I— >■ S;]^ , a I— >■ up(s 2 ) o flight}, {a: I— >■ S 2 , 2 up(sj^) o -ilight}} is a com- 

plete set of AClN-unifiers of these terms. According to (2.19), axioms ACIN* 
therefore entail 



Vx, 2 [ up(x) o z — up(s]^) o up(s 2 ) o -'light D 

X = s-^ A z = up(s 2 ) o -ilight V x = S 2 A 2 = np(s]^) o -ilight ] 



Annotation 2.1. Unification completeness. 



Before entering the axiomatization of our action theory, we prove some 
crucial properties of EUNA. These properties will be exploited later on to 
model, on the term level, the subset relation and the two set operations 
difference and union. 
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Proposition 2.9.1. Let A and B be two sets of fluent literals. 

1- If A B, then BUNA |= 3z. ta°z = tb, else BUNA \= \/z. ta°z ^ tb- 

2- If A C B, then BUNA \= \/z[tao z = tb = z = tb\a ] ■ 

3. If AU\ B = {} , then BUNA ^ \lz\z = ta° tb = z = takjb ] ■ 

Proof. 

1. Suppose A C B and let Z = B \ A. Then ta o tz and tb are ACIN- 
equal. Since BUNA includes ACIN, it entails ta ° tz = tb, hence also 
3z.ta o z = Tb. Suppose A % B, then tao z and tb are not ACIN- 
unifiable. According to clause 3a in Annotation 2.1, this implies BUNA 
entails Vz. ta ° z ^ tb . 

2. Let z be an arbitrary term, then ta°z and tb are AClN-equal iff each 
ffuent literal occurring in ta° z also occurs in tb and vice versa and no 
ffuent literal occurs twice or more in ta o z. This in turn is equivalent to 
z and Tb\a being AClN-equal (given that A C B), hence is equivalent 
to z = Tb\a under BUNA. 

3. An arbitrary term z and the term ta°tb are AClN-equal iff each ffuent 
literal occurring in z also occurs in ta o tb and vice versa and no ffuent 
literal occurs twice or more in z (given that An B = {}). This in turn 
is equivalent to z and taub being AClN-equal, hence is equivalent to 
2 = taub under BUNA. 

For illustration, consider A = {up(s 2 )}, B = {-iup(s;L), up(s 2 ), “'light}, 
and C = {up(s;l^), light}. Then AC B and AnC = {}. Accordingly, 
BUNA entails each of the following. 

3z. up(s 2 ) o z = -'up(sj^) o up(s 2 ) o -ilight 

Vz [up(s 2 ) o z = -'up(s]^) o up(s 2 ) o -ilight = z = -'up(s;L) o -ilight ] 

Vz [z = up(s 2 ) o up(s;]^) o light = z = up(sj^) o up(s 2 ) o light ] 

Clause 2 of the proposition will be particularly useful when axiomatizing 
the application of action laws: Suppose C be the condition of some action 
law, S be some state, and z be a term such that BUNA ^ tq o z = ts, 
then we know that z represents the set S \C. Precisely this is the reason 
why idempotency of function o is not stipulated. For if it would be, then 
Tc o z = Ts would not imply z = Tg\c. Rather for any set A we would have 
taota = Ta . But clearly ta yf ta\a unless A = {}. In contrast, BUNA as 
it stands entails ta°ta ^ ta except for ta = 0, i-e., A = {}. 

With the extended unique name assumption, BUNA, and its properties 
we are prepared for defining the circumstances under which a term represents 
a state. In what follows, we tacitly assume a fixed set of entities and fluent 
names. We use a many-sorted logic language with five sorts, namely, entities, 
fluent literals, collections of fluent literals, actions, and sequences of actions. 
Collections are composed of fluent literals, constant 0, and our connection 
function o. Below, variables of sort entity are indicated by x, variables of sort 
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fluent literal by £, variables of sort action name by a, and variables of sort 
action sequence by a*, sometimes with sub- or superscripts. All other vari- 
ables are of sort collection. Free variables are implicitly assumed universally 
quantified. 

To begin, the following axiom defines a predicate Holds {£, s) with the 
intended meaning that £ is contained in s: 

Holds{£,s) = 3z.£oz = s (2.20) 

Let, for instance, s = -'up(s;]^) o up(s 2 ) o light, then Holds(wp{s 2 ), s) and 
-'Holds{vcp(s-\^), s) . The next axiom determines the constitutional properties 
of state terms. 

State{s) = ^£[Holds{£,s) = ^Holds{-^£,s)] f\y£,z. s ^ £o £o z (2.21) 

In words, s represents a state if it contains each fluent literal or its negation 
but not both. Furthermore, no fluent literal may occur twice (or more) in s. 
For example, term s above satisfies State{s) provided our action domain is 
based on exactly the three fluents of which s is composed. If so, then we also 
have -iState{-^wp{s-i) o light), -i5'tote(-'up(s;l^) oup(sj^) oup(s 2 ) o light), 
and -i5'tote(-'up(s]^) o -iup(s;]^) o up(s 2 ) o light). The following proposition 
justifies our definition of State: 

Proposition 2.9.2. For a collection s, EUNA, {2.2D), {2.21) \= State{s) iff 
EUNA \= s = Ts for some state S , else EUNA, (2.20), (2.21) \= -iState{s). 

Proof. We have Ff/AA, (2.20), (2.21) ^ Statefs) iff EUNA and the ax- 
ioms (2.20) and (2.21) entail 

{3z. £ o z = s = VzL -'£ o z' ^ s) A Vt. s ^ £o £o z (2.22) 



for each fluent literal £. 

Suppose EUNA entails formula (2.22) for each fluent literal £. Define 
S = {£ : EUNA |= 3z. £ o z = s}, then entailment of the first conjunct 
of (2.22) ensures that S' is a state. It also ensures that S consists of pre- 
cisely the fluent literals which occur in s. Entailment of the second conjunct 
of (2.22) moreover guarantees that no fluent literal occurs twice or more in s. 
Altogether this implies EUNA ^ s = rs. 

a 5? . 

Suppose EUNA ^ s = rs for some state S. Then all fluent literals in 
Ts occur exactly once in s. Let t' be a fluent literal. S being a state implies 
that {£} C S iff {-'£} 2 S. Thus, clause 1 of Proposition 2.9.1 ensures that 
the first conjunct of (2.22) is entailed. Moreover, since s does not contain 
any literal twice or more, s and £ o £ o z are not AClN-uniflable. This 
implies EUNA |= Vz. s ^ £o £o z according to clause 3a in Annotation 2.1. 
Altogether, it follows that EUNA entails formula (2.22) for each literal £. 

Qed. 
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Our next concern is to axiomatize the notion of acceptable states wrt. a 
set of state constraints. Based on the definition of predicate Holds , (2.20), 
the encoding of fluent formulas is straightforward. In order to state that 
some formula F is true in the state represented by some term s, each fluent 
literal £ occurring in F is replaced by the expression Holds {£, s). E.g., state 
constraint light = up(s;[) A up(s 2 ) becomes 

H olds {light, s) = Holds{vLp{si), s) A Holds{up{s 2 ), s) 

just like state constraint Vx [fleeing(x) D alive(x)] becomes 

Vx [ Holds{flee±ng{x), s) D Holds{a.l±ve{x), s) ] 

For notational convenience we will simply write Holds{F, s) as abbrevia- 
tion of the formula thus constructed which states that F is true in s. This 
encoding of fluent formulas is justified by the following proposition. 

Proposition 2.9.3. Let F be a closed fluent formula and S a state. 
Then EUNA, (2.20) \= Holds{F,Ts) if and only if F is true in S, else 
EUNA, {2.20) \= ~^Holds{F,Ts). 

Proof. A fluent literal £ is true in S iff {£} C S. Following clause 2 of Propo- 
sition 2.9.1, the latter is equivalent to EUNA ^ 3z. £oz = rg, which in turn is 
equivalent to EUNA, {2.20) \= Holds{£,rs) according to axiom (2.20). The 
claim follows by straightforward induction on the structure of formula F. 

Qed. 

In particular, a state term is Acceptahlcs if it satisfies the underlying steady 
state constraints Cs, and it is Acceptable if it satisfies the entire state con- 
straints C, that is. 



Acceptable g{s) = Holds{F,s) 

(2.23) 

Acceptable{s) = A Holds{F, s) 

Fee 

Suppose, for example, 

Acceptable{s) = [Holds{light, s) = Holds{Ti-p{si), s) A Holds{np{s 2 ), s)] 

then we can conclude, for instance. Acceptable {-'wp{s.\_) o up(s 2 ) o -ilight) 
and -<Acceptable{up{si) o -iup(s 2 ) o light). 
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2.9.2 Axiomatizing Action Laws and Causal Relationships 

Having defined a suitable representation of states, we now axiomatize two 
ways of state modification, namely, by means of action laws and causal rela- 
tionships. To begin, let 

{ = ai transforms Ci into Ei, . . . , a„[xn] = a„ transforms C„ into En } 

be the underlying set of action laws. These laws define the ternary predicate 
Action{a, c, e) , an instance of which is true if and only if there exists an action 
law for action a with condition and effect that is, 

n 

Action{a, c, e) = \J 3xi [a = ai A c = A e = ] (2.24) 

i=l 

E.g., the familiar two action laws for toggling a switch may be encoded as 

Action{a,c,e) = [a = toggle(cc) A ( c = -'up(a;) A e = up(x) 

V c = up(a;) A e = -'up(a;) ) ] 

which is a slightly more compact equivalent of the schematic instantiation 
of (2.24). 

Applicability and application of action laws are axiomatized with the help 
of the properties of EUNA elaborated in the previous section. An action 
law whose condition is represented by collection c is applicable to a state 
represented by term s if and only if some term z can be found such that 
CO z equals s under EUNA. For the latter has been proved equivalent to 
set being subset of . Entailment of co z = s also guarantees that 
z contains all fluent literals in s but not in c. A preliminary successor, or 
rather a representation thereof, is then obtained by concatenating z and the 
effect term e of the action law in question, yielding term z o e. Correctness 
of this axiomatization of applying action laws is guaranteed by the following 
proposition. 

Proposition 2.9.4. Let A he a set of action names, L a set of action 
laws, S a state, and a an action. Furthermore, let s' he a collection of 
fluent literals. Then 

EUNA, {2.24) \= 3c,e, z[Action{a,c,e) A CO z = s /\ s' = z o e] (2.25) 

iff there exists a preliminary successor S' of S and action a such that 
EUNA 1= s' = Ts' , else 

EUNA, (2.24) 1= -Ac, e, z [ Action{a, c,e) A co z = s A s' = zoe] 

Proof. Formula (2.24) for predicate Action in conjunction with the standard 
equality axioms in EUNA imply that (2.25) holds iff C contains an action 
law instance a transforms C into E such that 

EUNA 1= 3z[tc o z = s A s' = z o te] 



This in turn holds iff 
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1. C C S' (according to clause 1 of Proposition 2.9.1), and 

2. EUNA ^ s' = Ti^s\C)yjE (according to clauses 2 and 3 of Proposi- 
tion 2.9.1 given that {S\C)r\E = {} , which is due to ||C|| = |lif||). 

These conditions are equivalent to action law a transforms C into E being 
applicable to S and resulting in S' = {S\C)\J E where EUNA ^ s' = rg/; 
hence, the conditions are equivalent to S' being a preliminary successor of S 
and a. Qed. 

As an example, consider the state {->up(s;]^), up(s 2 ), -ilight} and the action 
law instance toggle(s;L) transforms {->up(s]^)} into {up(s]^)}. Then EUNA 
entails 

3^ [-iup(s]^) o 2 ; = -iup(s;[) o up(s 2 ) o -ilight A s^ = z o up(s ] 

with s' = up(s;[) o up(s 2 ) o -ilight, because z can be substituted by the 
term up(s 2 ) o -ilight. Notice that s' represents the expected preliminary 
successor state. 

Subsequent to the generation of some preliminary successor is the (pos- 
sibly repeated) application of causal relationships. Let the underlying set of 
steady relationships be 

{ ri\xi] = Si causes pi if ^i, . . . ,rm[xm] = £m causes gm 

These relationships define the ternary predicate Causals{(-e,^e^ ^ 
stance of which is true if and only if there exists a steady causal relationship 
with triggering effect and ramification £g and whose context holds in 
state s, that is, 

m 

Causala{iE,£g, s) = \J 35;^ = Si t\ lg = Qi t\ Holds{<Pi, s)] (2.26) 

i=l 

Likewise, let {ri[xi] = Si causes gi if <Pi, . . . ,r„[x„] = £„ causes gn if ^n} 
be the underlying entire set of causal relationships, which define predi- 
cate Causal as follows: 

n 

Causal{te,lg, s) = \J 3xi = Si A £g = gi A Holds {<l>i, s)] (2.27) 

i=l 

E.g., recall the following familiar four (non-steady) causal relationships. 

up(sj^) causes light if up(s 2 ) -■up(s;L) causes -ilight if T 

up(s 2 ) causes light if up(s;[) ^up(s 2 ) causes -ilight if T 

These are encoded as 

Causal{is,£g, s) = ig = wp{si) A £g = light A Holds{up{s 2 ), s) 

V 4 = up(s 2 ) Aig = light A Holds{np{si), s) , . 

V 4 = ~'up(s]^) A = -ilight ^ 

V 4 = ~'up(s2) A£g = -ilight 
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Application of (steady) causal relationships to state-effect pairs is axiom- 
atized by defining the predicates Causes s{s,e, s' ,e') and Causes{s,e, s' ,e') , 
respectively, an instance of which is true iff some (steady) causal relationship 
is applicable to (s,e) and yields {s',e'): 

CauseSfls,e, s' , e') = 

CaUSals{£e,^g,s) A 3v.£iOV = 

A 

3z [ -iQi o z = S A s' = Z o Qi] 

34 , \ A 

Vw. -'Qi ow^e A e' = eo Qi 

V 

3w[-<giOw = e A e' = w o Qi] 

This definition needs explanation. Let e causes g if <P he some steady causal 
relationship whose context, <P, holds in state s. Furthermore, let S, E, S' , E' 
be the sets of fluent literals represented by s,e,s',e'. Then the equational 
formula in the first row on the right hand side of the equivalence encodes 
condition e € E. The second row simultaneously models the two conditions 
-<g € S and S' = {S \ {“'p}) U {gi}. Finally, axiomatizing the condition 
that E' = {E \ {-'g}) U {p} requires case analysis: If -•g ^ E, then we 
just have to add g to the corresponding term e (third row). If, on the 
other hand, -'g G E, then we have to additionally remove the sub-term -'g 
from e (fourth row) . The definition of Causes is analogous but with Causal 
replacing Causalg ■ 

Causes{s,e, s' ,e') = 

Causal{£e, ig, s) A 3v.£iOv = e 
A 

3z [ -'Qi o Z = S A s' = Z o Qi] 

A 

Vw. -'Qi ow^e A e' = eo Qi 

V 

3w[-<giOw = e A e' = w o gi] 

Correctness of this axiomatization of causal relationships is given by the 
following proposition. 

Proposition 2.9.5. Let TZ he a set of causal relationships. Furthermore, let 
S he a state, E a set of fluent literals, and s' ,e! two collections of fluent 
literals. Then 

FCAA, (2.20), (2.27), (2.30) h Causes { ts,te, s' , e') (2.31) 

iff there exist two sets of fluent literals S',E' such that (S,E) {S',E') 

and EUNA ^ s' = ts' A e' = te' , else 

FCAA, (2.20), (2.27), (2.30) h ~^Causes{Ts,TE,s',e') 
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Proof. Axioms (2.27) and (2.30) imply that (2.31) holds iff TZ contains a 
causal relationship instance e causes g ii'P such that the conjunction on the 
right hand side of the equivalence in (2.30) is entailed. This in turn holds iff 

1. A-'Q is true in S (according to Proposition 2.9.3); 

2. s' = under EUNA (according to clauses 2 and 3 of Propo- 

sition 2.9.1); 

3. {e} C E (according to clause 1 of Proposition 2.9.1); and 

4. a) either {-'£»} % E and e' = teu{q} under EUNA (according to 

clauses 1 and 3 of Proposition 2.9.1), 
b) or else {-■e} C E and e' = under EUNA (according 

to clauses 1, 2, and 3 of Proposition 2.9.1). 

These conditions are equivalent to causal relationship e causes g \f(!> being 
applicable to (S,E) and resulting in (S',E') such that EUNA ^ s' = ts> 
and EUNA |= e' = te'- Qed. 

As an example, recall (2.28), where Causal is defined on the basis of 
the four causal relationships which describe the usual relation between the 
switches S;^, S 2 and the light bulb. Then 

CoMses(up(s]^) o up(s 2 ) o -ilight, up(s;[), 

up(s]^) o up(s 2 ) o light, up(s j^) o light) 

as can bee seen by the fact that Vw. flight o w ^ up(s;[) and by re- 
placements u !->■ 0 and t >->■ up(sj^) oup(s 2 ) in (2.30). Notice that this cor- 
responds to up(s]^) causes light if up(s 2 ) applied to the pair, 

({up(si),up(s 2 ), -■light}, {up(si)}) 



yielding 

({up(si),up(s 2 ), light}, {up(si), light}) 

Successor states result from repeated application of causal relationships 
to some preliminary successor. The procedure is to first apply steady re- 
lationships until a state obtains that satisfies all steady state constraints. 
Thereafter, a single arbitrary relationship is applied, followed again by a 
series of steady relationships, and so on until an overall acceptable state re- 
sults. The axiomatization of this ramification procedure is based on defining 
two predicates Ramifys{s,e, s' ,e') and i?amz/)/(s, e, s'). A valid instance of 
the former shall indicate the existence of a sequence of steady causal relation- 
ships whose application to the state-effect pair (s,e) yields (s',e') such that 
Acceptable s {s') holds. Likewise, a valid instance of the latter shall indicate 
that, for some e', pair (s',e') results from the aforementioned alternated 
application of steady and arbitrary relationships such that Acceptable{s') 
holds. In essence the two definitions are transitive closures. As this mathe- 
matical concept cannot be expressed in first-order logic, we use the standard 
way of encoding transitive closure by means of second-order formulas: 
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Ramify ^{s,e, s', e') = 

Vsi,ei. iT(si, ei, si, ei) 

A 

Vsi, Cl, S2, 62, S3, 63 

Vi7 ( I [iT(si, 6i, S2, 62) A CaMseSs(s2, 62, S3, 63) 

-^(si, 61, S3, 63) ] 

D 

n{s, 6, s', e') 



(2.32) 



A Acceptable g{s') 



That is, an instance Ramify s{s,e, s' ,e') holds if and only if (s, 6 , s', o') be- 
longs to the transitive closure of CauseSg and if s' satisfies the underlying 
steady state constraints. Analogously, we define 



Ramify {s, e, s') = 

( \/si,ei,S 2 ,e 2 - Ramify g{si, 61 , 82 , 62 ) D il(si, ei, S 2 , 62 ) j 






Vsi, 61, S2, 62, S3, 63, S4, 64 

[iT(si, 6i, S2, 62 ) A Causes{s 2 , 62 , S3, 63) > 

A Ramify g{ss, 63, 84,64) D 71 ( 51 , 61 , 34 , 64 )] J 
D 



Be'. 77(s, 6 , s', e') 



(2.33) 



A Acceptable(s') 

That is, an instance Ramify{s, e, s') holds if and only if there exists some e' 
such that (s, e, s', e') belongs to the transitive closure of joining Ramify s to 
Causes, and if s' satisfies the underlying entire set of state constraints. Com- 
bining the axiomatization of action laws with this definition of ramification 
leads to the following characterization of successor states: 

Successor{s,a, s') = ro q/i'i 

3c, 6 , z [ Action{a, c, e)Acoz = sA Ramify {z o e, e, s') ] ' ’ 

To summarize, let T> he a, ramification domain, and let then iFCx> denote the 
union of EUNA with the definitions of 77oWs, axiom (2.20); Acceptable and 
Acceptabhs, axiom (2.23); Action, axiom (2.24); CauseSg and Causes, ax- 
ioms (2.29) and (2.30); Ramifps and Ramify, axioms (2.32) and (2.33); and 
Successor, axiom (2.34). The axioms iFCx> provide a correct axiomatization 
of transition in ramification domains, as the following theorem shows. 

Theorem 2.9.1. Let T> be a ramification domain and TCj) its encoding. 
Furthermore, let S and S' be two states and a an action. Then 

TCx> H Successor{s,a, s') 

iff there is a successor state S' of S and a such that EUNA |= s' = ts' , 
else 



TCd H ~'Successor{s,a, s ') 
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Proof. The claim follows from the previous propositions regarding correctness 
of the various axioms in and from the fact that the right hand side 

formulas of Equations (2.32) and (2.33) encode the appropriate transitive 
closures. Qed. 

Example 2.9.1. Let T> be the ramification domain modeling the electric cir- 
cuit consisting of two switches and a light bulb of Fig. 2.3 as in Example 2.2.2. 
Then TCd includes the axioms 

Action{a,c,e) = [ a = toggle(a;) A ( c = -'up(a;) A e = up(x) 

V c = up(cc) A e = -'up(a;) ) ] 

Acceptable g{s) = T 

Acceptable{s) = (i7oWs(light, s) = Holds{\xp{si),s) /\ Holds{\i.p{s 2 ),s)) 
Causals{£e,^g,s) = T 

Causal{£e,£g, s) = £s = 'ip(si) A £g = light A Holds (wp{s 2 ), s) 

M £^ = up(s 2 ) A £g = light A Holds{up{s-\^),s) 

V £s = -'up(s]^) A £g = -ilight 

V £s = -'up(s2) A £g = -ilight 

We have already seen that ECxi entails the following. 

[^up(sj^) o 0 = -iup(sj^) o up(s 2 ) o -ilight A s' = z o up(s]^) ] 

It also entails 

CaMses(up(s j^)oup(s 2 )o-ilight, up(s j^), up(s j^)oup(s 2 )olight, up(s j^)olight) 

The latter implies 

Ramify(up{si) o up(s 2 ) o -ilight, up(s;[), up(s;l^) o up(s 2 ) o light) 

according to axioms (2.32) and (2.33) and following the given definitions of 
Causes , Acceptablcs, and Acceptable. Consequently, 

5'MCcessor(-iup(s]^) o up(s 2 ) o -ilight, toggle(s]^), up(s]^) o up(s 2 ) o light) 

according to axiom (2.34). ■ 

2.9.3 Defining Transition Models 

The Fluent Calculus-based axiomatization lFCx> provides a formal account 
of the notion of state transition including ramifications. What remains to 
be done is to formalize the application of whole action sequences to an (un- 
specified) initial state in view of encoding observations. Our objective is to 
extend axioms IFCt) in such a way that there is a one-to-one correspondence 
between the standard, i.e., ‘classical’ models of the resulting axiomatization 
and the models of a set of observations in the sense of our action theory. 
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The core of this extension consists of two predicates Qualified{a*) and 
Result{a* , s) with the following intuitive meaning. A valid instance of the 
former indicates that the action sequence a* is executable starting in the 
initial state. A valid instance of the latter indicates that s represents the state 
which would result from performing a* in the initial state. As a notational 
convention, if a* is a (possibly empty) action sequence and a an action, 
then [a* I a] shall denote the action sequence which consists in a* followed 
by a. The crucial, domain-independent properties of the new predicates are 
determined by the following axioms: 

Qualified{[]) (2.35) 

Qualified{[a*\a]) = Qualified{a*) A 3s, s' Suca!s^or\^s, a, s^^ 

Result{[a*\a], s') D Vs [Result (a* , s) D Successor(s, a, s') ] (2.37) 

The reading of the topmost atomic formula, (2.35), is obvious. Formula (2.36) 
states, in words, that an action sequence [a*|a] is qualified if so is a* and 
if there exists a successor s' of s and a. Finally, implication (2.37) ensures 
that s' can only be the result of performing the sequence [a*|a] if s' is 
a possible successor when executing a in the state resulting from perform- 
ing a*. Notice, however, that the axioms (2.36) and (2.37) do not entail the 
existence of a resulting state whenever the corresponding action sequence is 
qualified, nor do they entail uniqueness of resulting states. We therefore need 
to add the following: 

3s. Result (a*, s) = Qualified{a*) (2.38) 

Result{a*,s) A Result {a* , s') D s = s' (2.39) 

Finally, the term intended to represent the initial state should qualify as 
such, namely, both in being a proper state term and in satisfying all state 
constraints, that is, 

Result{[],s) D State(s) A Acceptable{s) (2.40) 

In what follows, we prove that by introducing these domain-independent 
axioms we achieve what we have promised. Suppose given a ramification 
scenario {0,T>) with transition model A, and let tFCx> denote the encoding 
of domain V as described in the previous section. Let t be some classical 
interpretation of the set of formulas tFCxi U {(2.35)-(2.40)}, and let (A, Res) 
be an interpretation for (0,27). Then we say that l and (S,Res) eorrespond 
iff for all action sequences a*, states S, and collections of fluent literals s 
such that EUNA ^ s = rg we And that^*^ 

Below, by “ [P{ti, . . . , t„)Y is true” we mean that the n-tuple (t\, . . . , tY) is 
member of the relation which interpretation t assigns to predicate P, where 
ti (!<*<«,) denotes the element of l’s universe to which t maps term L. 
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Res{a*) = S iff [i?esMZi(a*, s)]Ms true (2-41) 

The notion of correspondence provides the basis for the correctness result of 
the entire axiomatization. 

Theorem 2.9.2. Let {0,T>) he a ramification scenario with transition model 
S, and let TCx> be the axiomatization of V. Then for each model l of 
TCd U {(2.35)-(2.40)} there exists a corresponding interpretation {S,Res) 
for {0,T>) and vice versa. 

Proof. 

a V . 

Let t be a model of .7 ^CdU{( 2.35)-(2. 40)}. We define a partial mapping Res 
from finite action sequences to states as follows: Res (a*) is defined when- 
ever [Qualified{a*)]'' is true; and in case it is defined, let s be a collection 
of fluent literals such that [Result {a* , s)Y is true, then Res{a*) = S for 
some S such that EUNA ^ s = rg. By induction on n, we prove that, 
for any action sequence a* = [oi, . . . ,a„], Res (a*) satisfies the condition of 
constituting an interpretation. By construction, the interpretation (Lf, Res) 
then corresponds to t. 

In the base case, n = 0, we have to show that i?es([]) is defined and 
satisfies the state constraints. According to axiom (2.35), [Qualified{[])]‘‘ 
is true; hence, i?es([]) is defined. Given that [Qualified{[])]'' is true, ax- 
ioms (2.38) and (2.39) imply that there is a unique (modulo ACIN) term s 
such that [i?esM^t([], s)]‘ is true. In conjunction with Proposition 2.9.2, for- 
mula (2.40) guarantees that s represents a state, which, moreover, satisfies 
the state constraints according to Proposition 2.9.3. 

For the induction step let n > 0 and suppose the claim holds for action 
sequence [oi, . . . ,a„_i]. Let a* = [oi, . . . , a„_i]. We have to show that 

a) Res{[a*\an\) is defined iff so is Res{a*) and E{Res{a*),an) is not empty, 
and 

b) i?es([a*|a„]) G E{Res{a*),an). 

From axiom (2.36) and the induction hypothesis for a* we conclude that 
[Qualified{[a*\an])\'' is true iff Res{a*) is defined and there exists a term s' 
such that [Successor{Tfif.s(^a*)j o-n, s')Y is true. This proves clause a) accord- 
ing to Theorem 2.9.1. Moreover, axioms (2.38) and (2.39) imply that if 
i?es([o*|a„]) is defined, then there is a unique (modulo ACIN) term s such 
that [Result{[a*\a„], s)Y is true. From axiom (2.37), the induction hypothe- 
sis for a*, and Theorem 2.9.1 it follows that s represents a successor state 
of Res (a*) and a„. This proves clause b). 

Let (S,Res) be an interpretation for (0, 25), and let t be an interpretation 
of tFC-D U {(2.35)-(2.40)| which satisfies the following: 



1. t is a model of tFCj). 
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2. For any action sequence a* and collection of fluent literals s, 

a) [Qualified{a*)Y is true if and only if Res{a*) is defined, and 

b) [Result {a* , s)Y is true if and only if both Res{a*) is deflned and 
EUNA h s = ruesia^)- 

We have to show that 6 is a model of lFCx)U{(2.35)-(2.40)}, in which case it 
corresponds to (27, Res) by construction. Given that i is a model of tFCj), 
it suffices to show that it is also a model of axioms (2.35)-(2.40). This in turn 
can be proved by induction on the length of the argument a* of Qualified 
and Result. This induction proof is completely analogous to the above. 

Qed. 



2.9.4 Encoding Observations 

The theorem at the end of the previous section shows the adequacy of the 
domain-independent axioms (2.35)-(2.40) as regards the formal notion of 
states resulting from performing action sequences. Next, and finally, we con- 
centrate on the scenario-specific expressions that are based on this concept, 
namely, the observations. Their axiomatization is straightforward. Any ob- 
servation F after [oi, . . . , a„] is encoded by the formula 

3s [ Result{[a\, . . . , a„], s) A Holds{F, s) ] (2.42) 

That is to say, the action sequence [ai , . . . , a„] must admit a resulting state 
in which, moreover, fluent formula F holds. The addition of these formulas 
automatically restricts the set of classical models to those which correspond 
to interpretations in which the respective observations are true. Let {0,T>) 
be a ramification scenario, then by we denote its axiomatization 

TC-d U {(2.35)-(2.40)} U {/\^gg,(2.42)}. Then classical entailment wrt. this 
axiomatization and the notion of entailment in our action theory coincide. 

Corollary 2.9.1. Let {0,T>) he a ramification scenario with axiomatiza- 
tion lFC(^o,v)- An observation F after [ai,...,a„] is entailed by {0,T>) iff 
l^C( 0 ,v) entails the formula 3s [Result{[ai, . . . , an], s) A Holds{F, s)]. 

Proof. Let t be a model of iFC(o,v) and (27, Res) a corresponding inter- 
pretation. Given Theorem 2.9.2, it suffices to show that i is a model of 
formula (2.42) iff the corresponding observation is true in (27, Res). By defi- 
nition, F after [oi o„] is true in {F,Res) iff Res{[ai, . . . ,an]) is defined 

and F is true in that state. This in turn is equivalent to t being model of 
3s[Result{[ai, . . . ,an],s) A Holds{F, s)] according to axiom (2.38), Propo- 
sition 2.9.3, and the fact that t and (27, Res) correspond. Qed. 

Example 2.9.2. Let T> be the ramification domain modeling the electric cir- 
cuit consisting of two switches and a light bulb of Fig. 2.3 as in Example 2.2.2. 
Furthermore, let TCx) be its axiomatization as explicated at the end of Sec- 
tion 2.9.2. Let O consist of the single observation 
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light after [toggle(s^)] 

then TCi^q xi) includes the axiom 

3s [ Result{[toggle{si)], s) A Holds{l±gh.t, s) ] 

Let tg be a term satisfying this conjunction, then axiom (2.37) implies 

Vs [ Result{[],s) D b'Mccessor(s, toggle(s]^), tg) ] 

In turn, this and the fact that Holds {light, tg) imply 

Vs [ i?esM^t([], s) D s = -'up(s]^) o up(s 2 ) o -ilight ] 

according to axiom (2.40) and TC(o,-d) - Following axioms (2.35) and (2.38), 
we thus conclude that 

3s [Result{[], s) A ~<Holds{wp{si), s) A Holds{wp{s 2 ), s) A Holds {light, s) ] 
In other words, {0,T>) entails the observation 

->up(s;l) A up(s 2 ) A -'light after [] 



2.10 Bibliographic Remarks 

The Ramification Problem was introduced and so named in [37].^^ This ar- 
ticle also contains the first proposal for a solution, which in essence is a 
specific realization of globally minimizing change. The authors themselves 
already argued that this approach may suggest unrealistic indirect effects, 
but no solution was offered. In [121] the idea was (informally) raised of in- 
troducing some notion of preference as regards changes of specific fluents 
over changes of other fluents. The first formal realization of this approach 
employed the concept of categorization [65]. The introduction of categories 
as means to avoid unmotivated indirect effects has later on been used in a 
variety of frameworks, accompanied by an inventiveness as regards names 
for these categories — e.g., frame vs. non-frame fluents [65, 57]; (the latter of 
which introduces ATZ as an extension of the Action Description Language A 
mentioned in Section 1.3); relevant vs. dependent [14]; persistent vs. non- 
persistent [18]; remanent vs. dependent [91]; inertial vs. non-inertial [41]; or 
persistent, remanent and contingent fluents [17]. While all being based on the 
principle of categorization, these approaches differ in the degree of sophisti- 
cation. The most elementary use of categories is to distinguish two classes of 

First published as [36]. The naming was inspired by [28], which was devoted to 
the problem of how to exploit logical consequences (so-called “ramifications” 
{sic)) of goal specifications in planning problems, with the aim of restricting 
search space. 
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fluents, one of which is always subject to the assumption of persistence while 
the other fluents may vary freely (in view of satisfying the state constraints). 
More elaborated methods define a partial preference ordering among all pos- 
sible changes — as we did in Section 2.3. Even further refinement is obtained 
by state-dependent categorization.^^ The report [92] presented an extension 
of “Features-and-Fluents” (c.f. Section 1.3) to provide a formal methodol- 
ogy for assessing the range of applicability of different approaches based on 
categorization. 

An early approach which exploits a specific notion of causality to address 
the Ramification Problem was proposed in [26]. This formalism supports 
indirect effects deriving from complete descriptions of how the truth-value 
of a particular fluent might be caused to change. As an example, recall the 
electric circuit consisting of two switches and a light bulb of Fig. 2.3. The 
approach of [26] would encode the various possibilities to affect the state of 
the bulb by these axioms: 

C'oMses(a, s, light) = CaMses(a, s, up(s;[)) A Holds ( vlp{s 2 ), s) 

V Cartses (a, s, up(s 2 )) A Holds(up{si), s) 

C ancels {a, s, light) = Cancels{a, s,up{si)) V C'anceZs(a, s, up(s 2 )) 

(2.43) 

where Causes{a, s, f) should be read as “executing action a in situation s 
causes fluent / to become true,” Cancels{a, s, f) as “executing action a in 
situation s causes fluent / to become false,” and Holds{f,s) as “fluent / 
is true in situation s.” Suppose we are given a specification of how up(sj^) 
may become true (resp. false), namely, 

CaMses(a, s, up(s j^)) = a = toggle(s;L) A -'i7oWs(up(s j^), s) 
(707106^(0, s, up(sj^)) = a = toggle(s^) A Holds{wp{s-^), s) 

plus this general axiom meant to express persistence of all non-affected flu- 
ents: 

Holds{f,Do{a,s)) = , . 

Causes{a, s, f) V { Holds{f, s) A ~'Cancels{a, s, f)) ' ’ 

where Do{a,s) denotes the situation obtained by executing action a in situ- 
ation s. One then obtains, e.g., that -< Holds (wp{si), So) AHolds{Mp{s 2 ) , Sq)A 
-<Holds{l±ght, Sq) implies C'aMses(toggle(s;[), S'o, up(s]^)) and, hence, also 
C'oMses(toggle(s;L), ^o, light). Thus Holds{l±ght, Do{toggle{s-i), Sq)), as 
intended. No effort has to be made to suppress an unwanted change of up(s 2 ) 
since no causal relation like the ones in (2.43) exists that may support this. 
On the other hand, the use of definitional descriptions of causal dependencies, 
as in (2.43), is restricted to domains where these dependencies are acyclic. 

We note, however, that this refinement wonld not help with our connter- 

example involving a relay introdnced at the end of Section 2.3. 
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i.e., hierarchical. Otherwise, that is, if fluents depend mutually, unmotivated 
changes cannot be precluded. To see why, consider the elementary cyclic 
specification 



C'oMses(a, s, up(s = OoMses(a, s, up(s 2 )) 

Cancels(a, s,up(si)) = Cancels(a, s,up(s 2 )) 

modeling the two switches connected by a tight spring of Fig. 2.5. Suppose 
both up(s;l^) and up(s 2 ) be true in initial situation Sg, and let A be 
some action whose execution does not affect up(s]^) nor up(s 2 ). Then the 
two formulas (2.45) in conjunction with the persistence axiom (2.44) are 
too weak to conclude that up(s;]^) and up(s 2 ) remain true. For neither 
-'C'onceZs( 2 l, ^o, up(s j^)) nor -iC'oncefe(7l, S'o, up(s 2 )) is entailed. 

Cyclic causal dependencies are no obstacle to the approach developed 
in [14]. This method is based on so-called “causal implications,” which, by 
virtue of being directed, cannot be applied in a non-causal way. For instance, 
the causal implication up(s;l^)Aup(s 2 ) light has a different meaning than, 
say, up(s]^) A -ilight ->up(s 2 ) . In the original approach, successor states 
are obtained by minimizing change while respecting domain-specific causal 
implications. A far more simple definition of successor states based on causal 
implications was presented in [73]. As opposed to [14], this subsequent method 
moreover allows for the distinction between implicit qualifications and rami- 
fications deriving from state constraints (recall Section 2.8), the necessity of 
which was first pointed out by [38]. The reason is that in the approach [14] 
one always strives for a successor state no matter how many changes are nec- 
essary to this end, while [73] additionally requires all changes be explicitly 
grounded on some causal implication. A more detailed comparison between 
the two approaches can be found in [73]. A closely related approach, [30, 31], 
is based on a nonmonotonic theory of “conditional entailment” and uses ex- 
pressions similar to causal implications. 

The nature of causal implications resembles the concept of causal rela- 
tionships propagated in this book. In fact, our flxpoint characterization of 
causal minimizing-change successors was borrowed from [73] and transferred 
to causal relationships. A crucial difference between causal implications and 
relationships is that only the latter distinguishes between a context an ex- 
plicitly occurred effect. E.g., the two relationships £i causes £ if £2 and 
£2 causes £ if £± are not interchangeable while both corresponding to the 
identical causal implication, viz. £\ A £ 2 ^ £■ The following simple scenario 
illustrates the usefulness of causal relationships being more expressive in this 
sense. 

Example 2.10.1. Consider a more subtle, ancient method to hunt turkeys, 
namely, by using a (manually activated) trapdoor. The state of this trapdoor 
is formalized by the fluent name trap-open*^. The fluent at-trap(turkey) 
describes whether the victim is in the dangerous zone or not, and the fluent 
name alive^ is used as before. The ground underneath the trapdoor is 
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designed such that nothing being at-trap with the trapdoor open can be 
alive. This is represented by the state constraint 

Va; [ at-trap(x) A trap-open D -'alive(a;)] (2-46) 

We can open the trapdoor and entice the turkey, respectively, via these two 
action laws: 



open transforms {-■trap-open} into {trap-open} 

entice(x) transforms {^at-trap(a;)} into {at-trap(a;)} 

While the state of the trapdoor can possibly affect the turkey being alive, 
the animal is alert to the extent that it would never kill itself by moving 
towards the open trapdoor; hence, trap-open may influence alive(turkey) 
but at-trap(turkey) may not so. The latter is intended to give rise to the 
implicit qualification -itrap-open for the action entice(turkey) . Therefore 
the correct influence information is I = {(trap-open, alive(turkey))}. In 
conjunction with our state constraint (2.46), this determines a single causal 
relationship, viz. 

trap-open causes ->alive(turkey) if at-trap(turkey) (2.47) 

Given the state S = {alive(turkey), at-trap(turkey), ->open} (say, af- 
ter having enticed the turkey), performing the action open yields the state 
{alive(turkey), at-trap(turkey), open} as preliminary successor, which 
violates the state constraint. Since trap-open occurred as effect, the given 
causal relationship is applicable, which results in the unique and expected 
successor state {-■alive(turkey), at -trap(turkey), trap-open}. 

Consider state 5”= {alive(turkey),-iat-trap(turkey), trap-open} and 
action entice(turkey), on the other hand. Performing this action yields the 
preliminary successor {alive(turkey), at -trap(turkey), trap-open}, too, 
but now obtained through the direct effect E = {at-trap(turkey)}. There- 
fore the only available causal relationship is not applicable; hence, no suc- 
cessor state exists. In other words, -itrap-open is revealed as additional, 
implicit qualification for entice(turkey), as expected. Notice that we are 
only able to distinguish between these two cases by employing causal rela- 
tionship (2.47) but not the analogue, namely, 

at-trap(turkey) causes -lalive (turkey) if trap-open 

Since these two relationships both correspond to the same causal implication 
at-trap(turkey) A trap-open ^ ->alive(turkey), this distinction goes be- 
yond the expressiveness of causal implications. ■ 

In [69] , which improves previous work where indirect effects are ‘compiled’ 
into action descriptions [68], first-order formulas resembling causal relation- 
ships are employed to define dependencies between effects and their indirect 
consequences. These formulas are of the form 
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<?(s) A Caused{fi,Vi, s) A ... A Caused{fn,Vn, s) D Caused{f,v,s) (2.48) 

where Caused{f,v,s) should be read as “fluent / is caused to take on truth- 
value V in situation s,” and where ^{s) describes properties of situation s. 
Notice the distinction between a context, i.e., d>, and explicitly occurring 
effects /i, . . . , E.g., a formalization of the electric circuit involving two 

switches and a light bulb of Fig. 2.3 would include the specification 

Holds {\rp{s 2 ), s) A CaMse(i(up(s;[), Trucks) D CaMse(i(light, True,s) (2.49) 

along with the action deflnition^^ 

-< Holds { vlp{s i), s) D C'oMsed(up(s;l^), Trae, i9o(toggle(s;l^), s)) (2.50) 

The general axiom of persistence used in this context is 

-'Bv. Caused{f,v, Do{a, s)) D { Holds{f, Do{a, s)) = Holds{f, s)) (2.51) 

This axiom is of course useless unless an instance Caused{F,V, Do{A, S)) 
is provably false whenever it does not follow from the domain-specific ax- 
ioms both for direct effects of actions (like in (2.50)) and for causal de- 
pendencies (like in (2.49)). This is formally achieved by minimizing the 
set of true instances of Caused via so-called circumscription [77]. For 
instance, suppose a situation Sq be specified by ->Holds(wp{si), So) A 
Holds (up{s 2 ), So) A ~'Holds{light, So) , and let Si = Ho(toggle(s;L), ^o) . 
Then the axioms (2.49) and (2.50) entail Caused{np{si), True, Si) , hence 
Caused {light, Ttmc, S'!). They do not entail 3v. CaMse(i(up(s 2 ), u. S'!). Mini- 
mizing Caused therefore allows to conclude -'3w. CaMse(i(up(s 2 ), w, S'!) and, 
hence, Holds{'o.p{s 2 ), Si) according to axiom (2.51). 

An alternative causality-based method to address the Ramification Prob- 
lem, which also includes the distinction between context and triggering effect, 
is [47]. 

The causality-oriented approaches cited so far all intrinsically follow the 
policy of minimizing change. This amounts to rejecting any potential suc- 
cessor state whose distance to the original state is strictly greater than the 
distance of another proper successor state. With our electric circuit involving 
the light detecting device (Example 2.6.3) we have seen that the applica- 
bility of this paradigm is limited. Causal relationships and their successive 
application to preliminary successor states constitute the first approach that 
does not follow the principle of minimizing change [113];^^ the notion of 
steady state constraints has been introduced in [116]. In [93], an extension 
to “Features-and-Fluents” was developed as an alternative to the aforemen- 
tioned [92] for categorization-based approaches. The newly proposed action 

For the sake of simplicity we neglect the concept of action preconditions here. 
First published as [109]. The approach [71], which also supports non-minimal 
successor states, emerged at the same time but proved erroneous in producing 
lots of ‘magic’ indirect effects. 
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theory is causality-oriented and follows the idea of successively generating 
indirect effects, starting with preliminary successors and continuing until an 
acceptable state is obtained. Minimality of changes is not required. This 
general approach was used in [48] to extend the so-called Temporal Action 
Logic [23]. The approach [10] builds on the action calculus called Linear 
Connection Method (see below) . It introduces resource-sensitive implications 
which are syntactically identical to the action descriptions used therein. These 
additional implications are applied with higher priority and have certain flu- 
ent literals marked which must have previously occurred as effects in order for 
the implication to apply. The resulting calculus can be viewed as realization 
of causal relationships as means to address the Ramification Problem. 

In [21], a least fixpoint semantics is used to model chains of indirect effects. 
The article also contains the proposal to extend the Ramification Problem 
to indirect effects which derive from constraints that connect the states in 
different situations. 

An approach which is considerably different from all methods discussed 
so far yet still related is based on networks representing probabilistic causal 
theories [81]. These networks describe, in the first place, static dependencies 
among their components. As argued in [82, 83], however, the truth- values 
of one or more nodes may be reset dynamically and, then, the values of all 
depending nodes need to be adjusted according to standard (Bayesian) rules 
of probability. This can be regarded as generating indirect effects. If prob- 
ability values are restricted to the binary 0/ 1-case, then a network whose 
nodes are fluent names resembles our concept of influence information. For 
instance. Fig. 2.13a) depicts a network suitable for our electric circuit with 
the relay and light detector. In view of a general solution to the Ramifi- 
cation Problem, some restrictions of causal networks are worth mentioning. 
First, we note that the resulting value of a node, after having fixed the di- 
rect effects, must not be computed until all new values of its predecessors 
have been determined. Consequently, the proposition detect in the net- 
work of Fig. 2.13a) necessarily remains false after toggling switch in the 
state depicted in Fig. 2.8; hence, the non-minimal successor state where a 
light flash has been detected cannot be obtained. Second, recall the domain 
with the trapdoor. Example 2.10.1. Since fluent trap-open changing pos- 
sibly affects alive(turkey), depending on at-trap(turkey), the adequate 
network is the one depicted in Fig. 2.13b). This, however, does not allow to 
distinguish between the two situations where either trap-open becomes true 
with at-trap(turkey) being true, or it happens to be the other way round. 
Hence, the distinction between context and triggering effect is not supported 
by causal networks. Finally, networks representing causal theories are based 
on acyclic graphs, which means that cyclic dependencies, like the one given 
by our switches of Fig. 2.5, which are being connected by a spring, cannot be 
represented (c.f. Fig. 2.13c)). 

The Fluent Calculus paradigm we used for an axiomatization of our ac- 
tion theory emerged from the urge to solve the fundamental so-called Frame 
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a) c) 



Figure 2.13. Graphs representing the structural dependencies between fluents re- 
garding a) Example 2.6.3, b) Example 2.10.1 (where t abbreviates turkey), and 
c) state constraint up(s]^) = up(s 2 ) (inducing a cycle), respectively. 

Problem. Introduced in the context of Situation Calculus [74], it concerns 
the at first glance trivial but in fact highly problematic challenge to specify, 
in logic, that non-affected fluents keep their truth-values during the perfor- 
mance of actions. To see where the difficulties lie, observe first that naively 
describing the effects of actions such as toggle(a;) by means of classical 
material implication does not work. E.g., the specification 

Ho(toggle(x)) D (^up(x) D up(x)) (2.52) 

is inconsistent with, say, -'up(s;[) Ai7o(toggle(s]^)), i.e., the seemingly natu- 
ral logical formalization of a situation where switch is down and is about 
to being toggled. This is why Situation Calculus introduces an additional so- 
called situation argument to each action and fluent, restricting its scope to a 
particular one out of many possible situations. The effect description (2.52) 
thus becomes 

->up(a:, s) D up(a;, Do(toggle(a:), s)) 

where s denotes some abstract situation and i7o(toggle(a;), s) the succes- 
sor situation resulting from performing toggle(ai) in s. This representation 
technique, however, raises the problem of how to conclude that some other flu- 
ent which holds in s, say up(s 2 ), still holds in situation i7o(toggle(s]^), s). 
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In order that this conclusion is granted, an additional so-called frame axiom 
is required, namely, 

X y A np{y,s) D up(y, Do(toggle(a;), s)) 

Now, the Frame Problem concerns the need for a large number of these 
frame axioms (the representational aspect of the problem) and the necessity 
to carry, one-by-one, each unchanged property to the next situation (the 
inferential aspect). 

It took more than two decades to solve the representational aspect to the 
best possible extent. The approach [86] is based on pure classical logic and 
completely avoids the specification of frame ax;ioms. This is accomplished by 
combining, separately for each fluent, in a single effect ax;iom all possibilities 
of how the fluent may change to true and to false, respectively. By virtue of 
being bi-conditionals, these so-called successor state axioms implicitly contain 
sufficient information to also entail any non-change of the fluent in question. 
An example is 

up(a;, F)o(a, s)) = o = toggle(a:) A ->up(x,s) 

V-'[up(x, s) D a = toggle(x)j 

Given ->up(sj^, S'o)Aup(s 2 , S'o), this axiom entails up(sj^, £)o(toggle(s;L), S'o)) 
and also up(s 2 , £^o(toggle(s;[), S'o))) tacitly assuming that toggle(s]^) yf 
toggle(s 2 ). While the concept of successor state axioms perfectly solves the 
representational aspect of the Frame Problem, it does not at all address the 
inferential aspect. For it still requires, for each non-affected fluent, separate 
application of one of these ax;ioms in order to conclude that the fluent keeps 
its truth-value in the resulting situation. 

The Strips framework [27] was an early development in response to the 
inferential challenge raised by the Frame Problem. Strips encodes states 
as sets of fluents, and the performance of actions is specified operationally, 
namely, by removal and addition of certain fluents to these sets. Apparently, 
this avoids investigation of any non-affected fluent. In compensation, the 
operational, non-declarative nature of this approach causes the loss of both 
expressiveness and flexibility of logic. With the aim of regaining the latter 
without losing the computational merits of Strips, the Linear Connection 
Method has been developed in [9], a precursor of Linear Logic [39]. It employs 
a non-classical, resource-sensitive implication that allows to specify effects of 
actions as straightforward as in 

->up(a:) & 5'z<(s) -<> up(x) & 5'zt(F)o(toggle(x), s)) (2.53) 

where “ & ” denotes a non-idempotent conjunction, “ — o ” the aforemen- 
tioned, non-classical implication, and Sit{s) should be read as “the cur- 
rent situation is s.” Being resource-sensitive, the antecedent of an impli- 
cation like (2.53) is ‘consumed’ whenever deriving the consequent. Conse- 
quently, ->up(s;[) &up(s 2 ) & Sit{So), say, in conjunction with formula (2.53) 
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entails up(s^) &up(s 2 ) & Sit{Do{toggle{s-^), Sg)) but not the contradiction 
-iup(s;l^) & up(s 2 ) & SU{Sq) & up(sj^) & Sit{Do{toggle{s-^), 5'o)). A compari- 
son between Linear Connection Method and Situation Calculus can be found 
in [8]. A thorough study of the relationship with Linear Logic is reported 
in [29], where also an inference engine for the Linear Connection Method is 
described. 

The Fluent Calculus paradigm [51], so named by [12], embeds in pure clas- 
sical logic the notion of resource-sensitivity as means to reflect the dynamics 
of state transitions. For a specific class of planning problems, axiomatizations 
based on Linear Connection Method, Fluent Calculus, and Linear Logic (in 
the variant of [72]) have been proved equivalent [45]. In [118] it is shown 
that under the provision that actions do not have an unbounded number of 
direct effects, any Situation Calculus specification along the line of [86] can 
be transformed into an essentially equivalent Fluent Calculus specification, 
in which at the same time the representational and the inferential aspect of 
the Frame Problem are addressed. Axiomatizations using the paradigm of 
Fluent Calculus have been developed for a variety of ontological aspects — we 
just mention concurrency of actions [12], continuous change [50, 117], hier- 
archical planning [25], and nondeterministic [119] and complex actions [52]. 
Fluent Calculus grounds on the algebraic theory of Abelian monoid, which 
is given by the axioms of associativity, commutativity, and existence of a 
unit element. Unification algorithms specialized in this equational theory are 
described in, e.g., [105, 16, 44] (for an overview see [80]). An analysis of com- 
plexity is performed in [56]. The related notion of unification completeness 
has been developed in the context of logic programming. Introductions to this 
principle can be found in, e.g., [54, 101, 111]. The existence of a unification 
complete theory for ACl has been shown in [53]. 
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3.1 Abnormal Action Disqualifications 

Suppose you want to start your car to take a ride. What are the preconditions 
for doing this? Presumably you first want to make sure that you got hold of 
the ignition key. Does this justify the conclusion that the action can now 
be successfully performed? Not if you are overly cautious, in which case you 
might also want to check that the gas tank is not empty. Still, however, there 
are numerous other imaginable causes for being unable to ignite the engine of 
your car. For instance, you have failed to make sure that no potato clogs the 
tail pipe, despite the fact that this necessarily renders the intended action 
impossible. 

Of course there are good reasons for ignoring the possibility of the tail pipe 
housing a potato. It seems highly inappropriate, in general, to explicitly verify 
every imaginable precondition of an action — in fact this is even impossible: 
Apart from the fact that besides a clear tail pipe there are lots of further 
disqualifying obstacles to our example action, how would you ensure that 
after checking the tail pipe it does not become clogged during you walking 
to the front door and taking a seat, prior to trying to ignite the engine? 
Actors in real world environments would therefore be totally paralyzed would 
they never perform an action without worrying about every however unlikely 
obstacle. 

The vast majority of preconditions for actions in daily life are so likely to 
being satisfied that common sense assumes them away as long as there is no 
evidence to the contrary. On the other hand, ignoring all these ‘abnormal’ 
action disqualifications prima facie also means to being able to handle situa- 
tions where the prior assumption of executability turns out wrong. This is in 
contrast to the idealistic view our action theory takes so far. The existence 
of a successor state according to the underlying transition model is supposed 
to guarantee the successful performance of the action in question. Making 
the contrary observation is considered impossible insofar as it would render 
inconsistent the entire scenario. Let, for example, the state of a car’s engine 
be denoted by the fluent runs and suppose the action of starting the car be 
specified by the action law ignite transforms {-runs} into {runs}. Then 
the two observations -iruns after [] and ->runs after [ignite] constitute a 
scenario that admits no models at all. 
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Despite being a necessary precondition, the absence of obstacles like 
“there is a potato in the tail pipe” should, however, not be added to the 
condition part of the above action law. For otherwise this would require what 
has been said inappropriate, namely, the inspection of all abnormal disquali- 
fications of this action prior to assuming it executable. The general challenge, 
therefore, is to weaken the strict conclusion that actions are guaranteed to 
producing the expected effect once all specified preconditions are satisfied. 
This conclusion should become an assumption by default. As such it is to 
be made as long as there is no information to the contrary. Developing a 
formal account of this concept within the framework of an action theory is 
the Qualification Problem. Solving it is necessary in view of applying action 
theories to real-world environments, which do not conform with the idealis- 
tic view in that most if not all actions are potentially subject to abnormal 
disqualification . 

Let us state the problem more precisely. Suppose given the two fluents 
runs and in(pt) stating, respectively, whether the engine of the car is run- 
ning and whether its tail pipe houses a potato. As above, the action ignite 
shall be specified by the action law ignite transforms {-runs} into {runs}. 
Consider the situation where the only available information concerns the state 
of the engine, which is known to be off. Nothing is known as to whether or 
not the tail pipe is clogged. The formal specification of this scenario thus 
consists of the observation 



-•runs after [ ] 

Then by default it should follow that runs after [ignite]. The argument 
supporting this conclusion would be that there is no hint at a potato being 
placed in the tail pipe and the only ‘regular’ precondition of the action, viz. 
that the engine is not running already, is true in the initial state. ^ Our action 
is therefore to be assumed executable and so to have the expected effect. But 
now consider the situation where we do not only know the engine is off but 
also that there is a potato in the tail pipe, that is, 

->runs after [] 
in(pt) after [] 

Then it should no longer follow that igniting the engine is possible nor, hence, 
that the observation runs after [ignite] holds. 

This example reveals a crucial principle with which the Qualification 
Problem is inherently connected. Namely, additional observations, in our case 
in(pt) after [], may force the withdrawal of previously valid conclusions. A 
so-called nonmonotonic entailment relation for observations is required to 
this end. This is in contrast to our current action theory being monotonic in 
that sense: Whenever two sets of observations Oi and O2 satisfy Oi QO2, 



^ By “regular” preconditions we generally mean both the conditions occurring in 
an action law and the implicit qualifications deriving from state constraints. 
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then any observation entailed by Oi is also entailed by 02- This is a conse- 
quence of the fact that adding observations only reduces the set of models. 
A fundamental task when addressing the Qualification Problem is therefore 
to modify an underlying monotonic entailment relation so as to become a 
nonmonotonic one. 



3.2 Minimizing Abnormal Disqnalifications 

Recall from above the scenario where the only observation concerns the en- 
gine, stating that it is not running. No information is given as to whether the 
tail pipe is clogged or not. According to our action theory, the set of models 
of this scenario divides into two classes, one of which consists in all models 
which consider initially false fluent in(pt), while the models of the other 
class consider that fluent true. Suppose somewhere in the underlying domain 
specification it is said that in(pt) being true renders action ignite unex- 
ecutable even if all regular preconditions are satisfied. Then half the models 
entail this very conclusion, i.e., that the action cannot be performed, the 
others do not so. Hence, if all models need to be taken into account when de- 
ciding on entailment, then nothing follows from the given observations as to 
the action in question being executable. As argued in the introduction to the 
Qualification Problem, however, it is reasonable to conclude, by default, that 
starting the engine is possible. Granting this conclusion therefore requires to 
ignore all those models which claim the contrary. 

Dubious and unsound as it may seem to simply disregard ‘inconvenient’ 
models preventing the desired conclusion there is good reason for so doing in 
this particular situation. It has been said that an abnormal disqualification 
should be assumed away by default, that is, as long as this assumption is 
consistent with the available information. From the model perspective, this 
amounts to disregarding all models which entail an abnormal action disqual- 
ification if only at least one model exists of the scenario at hand where this 
abnormality does not hold. One way of achieving this formally is to introduce 
a so-called preference criterion. Being a partial order on models, this criterion 
will allow us to select the most preferred ones and, then, to confine ourselves 
to those when talking about entailment. 

The concept of model preference is inherently nonmonotonic, just as the 
Qualification Problem requires it. Additional observations may invalidate 
some of the models, among which, moreover, might be all previously pre- 
ferred ones. As a consequence, models that have been disregarded earlier 
may now come to light and give rise to a very different set of entailed formu- 
las. Recall our example where we add to -runs after [] the observation that 
there is a potato in the tail pipe initially, i.e., in(pt) after [] . This new infor- 
mation falsifies every model that claims the possibility to ignite the engine. 
The remaining, previously unpreferred models all suggest an abnormal dis- 
qualification of the action, hence do now entail ignite inexecutable after [], 
as we will note this kind of observation: 
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Definition 3.2.1. Let £ , T , and A he sets of entities, fluent names, and 
action names, respectively. A disqualification observation is of the form 

a inexecutable after 

where each of a,ai, . . . ,a„ is an action (n>Q). If Res is a partial map- 
ping from finite (possibly empty) action sequences to states, then this dis- 
qualification observation is true in Res iff Res{[ai, . . . ,an]) is defined but 
Res{[ai, . . . ,an,a\) is not. ■ 

In what follows, the term “observation” refers both to observations in the 
original sense as introduced in Chapter 1 (c.f. Definition 1.2.5) and to dis- 
qualification observations. 

For a formal account of the approach to the Qualification Problem 
sketched above we need means to connect abnormal disqualifications of ac- 
tions with the situations that give rise to them. The following notion serves 
this purpose. 

Definition 3.2.2. Let £ , T , and A he sets of entities, fluent names, and 
action names, respectively. A disqualifying condition is an expression of the 
form F D disq{a) where F is a fluent formula and a an action. ■ 

For notational convenience, both F and a may contain variables, in which 
case the disqualifying condition F D disq{a) is regarded as representative 
for all its ground instances. An example is in(a:) D dzsg(ignite), stating 
that any object clogging the tail pipe unqualifies the action of starting the 
engine. 

A disqualifying condition F Z) disq(a) indicates that whenever formula F 
is true, then action a cannot be performed even if all of its regular precondi- 
tions are satisfied. Having disqualifying conditions requires an extended no- 
tion of interpretations and models (E,Res) of action scenarios. If an action a 
is disqualified in a particular state i?es([oi, . . . ,an]), then this dictates that 
Res{[ai , . . . ,a„,a]) is undefined regardless of whether the underlying tran- 
sition model, E, suggests a successor state of i?es([oi, . . . , a„]) and a. To 
allow for comparison of models in view of preferring those with the fewest pos- 
sible abnormal disqualifications, a third component is introduced into both 
interpretations and models. This new argument, denoted Ab, reflects all situ- 
ations where an action cannot be performed on account of some disqualifying 
condition. 

Definition 3.2.3. Let {0,T>) he a ramification scenario, and let Q he a set 
of disqualifying conditions. An interpretation with abnormalities for {0,T>) 
is a triple {E, Res, Ah) where E is the transition model of T>, Res is a 
partial function which maps finite (possibly empty) action sequences to ac- 
ceptable states, and Ab is a set of non-empty action sequences such that the 
following holds: 

1. i?es([]) is defined. 

2. For any sequence a* = [oi, . . . , ak-i,ak] of actions (k > 0 ): 
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a) Res{a*) is defined if and only if i?es([ai, . . . , afe_i]) is defined; 
S{Res{[ai, . . . ,ak-i]),ak) is not empty; and no F D disq{ak) € Q 
exists such that F is true in Res{[a\, . . . ,afc_i]). 

b) Res{a*) G S{Res{[ai,. . . ,ak-i]),ak). 

c) a* G Ab if and only if Res{[a\, . . . ,ak-i\) is defined and there is 

some F D disq{ak) such that F is true in i?es([ai, . . . , Ofc-i]) . ■ 

Put in words, whenever Res{[a\, . . . ,afc_i]) entails the antecedent of a dis- 
qualifying condition for action Ofe, then Res{[ai, . . . , afc]) is not defined and 
Ab includes the sequence [oi, . . . , a^] . 

The appropriate preference criterion and entailment relation can then be 
defined straightforwardly on the basis of comparing the additional compo- 
nents, Ab. Informally speaking, the less action sequences are declared dis- 
qualified by a model the less abnormal the latter. Entailment is then decided 
on the basis of least abnormal models, which thus are the preferred ones. 

Definition 3.2.4. Let {0,T>) be a ramification scenario with transition 
model E and Q a set of disqualifying conditions. If I = (E,Res,Ab) and 
/' = {E , Res' , Ab') are interpretations with abnormalities for {0,T>), then I 
is less abnormal than I' , written I < I' , iff Ab ^ Ab' . A model with abnor- 
malities of {0,T>) is an interpretation (E,Res,Ab) such that each o € O 
is true in Res . A model is preferred iff there is no other model which is less 
abnormal. An observation o is entailed iff it is true in all preferred models 
with abnormalities of {0,T>). ■ 

Let us see how this account of abnormal action disqualifications solves 
our initial example. 

Example 3.2.1. Let T> be the ramification domain consisting of entity pt, 
fluent names runs° and in^, and action name ignite accompanied by 
the action law ignite transforms {-runs} into {runs}. Furthermore, let 
in(x) D dzsg(ignite) be a disqualifying condition. Suppose E be the tran- 
sition model of T>, and let 0\ consist of the single observation 

-runs after [] 

Two models Mi = {E,Resi,Abi) and M 2 = {E, Res 2 , Ab 2 ) exist for 
the scenario (Oi,T>), namely, where i?esi([]) = {-runs, ->in(pt)| and 
i?es 2 ([]) = (“Tuns, in(pt)}. Since the antecedent of the instance {x 1 — pt} 
of the underlying disqualifying condition, in(a;) D dzsg( ignite), is true in 
^es 2 ([]), we have A &2 = |[igiiit;e]}. In contrast, the first model entails no 
abnormal disqualification, that is, Ab\ = {}. Hence, M\ -< M 2 . Model Mi 
being the unique preferred one, we see that our scenario entails the observa- 
tion runs after [ignite] . 

Let, on the other hand, O 2 consist of the two observations 



^runs after [] 
in(pt) after [] 
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then the only model for { 02 ,T>) is M 2 from above, which thus is preferred. 
The state i?es 2 ([ignite]) is not defined due to our disqualifying condition. 
Hence, {02,T>) entails the observation ignite inexecutable after []. ■ 

This example demonstrates that our account of the Qualification Problem 
meets the basic requirement: Abnormal disqualifications may be assumed 
away by default, and in fact are so whenever this offers a coherent account of 
the observations. Consequently, the new entailment relation is nonmonotonic. 

Our way of assuming away unlikely action disqualifications for the gener- 
ation of preferred models can be characterized as globally minimizing abnor- 
malities. No preference is given in case an action scenario implies that there 
be one out of two alternative abnormalities without necessitating a partic- 
ular one. Suppose, for instance, observation suggests that there either be a 
potato in the tail pipe or a used chewing gum in the car’s door lock thus 
rendering unexecutable the action of unlocking the door. This gives rise to 
two preferred models, one of which accounts for the first and the other for 
the second abnormal disqualification. The two models are considered equally 
plausible, which raises the question whether so doing might not be overly 
credulous. If, say, the neighborhood where we have left the car unattended 
for some time suffers from frequent strikes of a tail pipe marauder, then like- 
lihood might favor the conclusion that we will be unable to start the engine 
while succeeding in unlocking the door. 

One solution to this problem is the explicit introduction of probability val- 
ues for each potential abnormal disqualification. In most real-world domains, 
however, it seems difficult if not impossible to acquire knowledge this pre- 
cise. The Qualification Problem is therefore concerned with qualitative rather 
than quantitative reasoning about unlikely disqualifications of actions.^ Ac- 
cordingly, we will later (in Section 3.5) provide means to distinguish different 
degrees of (im-)probability. This will enable us to prefer minimization of ab- 
normalities that are a priori more unlikely to occur than others. 

The question remains whether global minimization of abnormal action 
disqualifications risks to ignore other fundamental reasons for preferring an 
abnormality over another one — that is, reasons which are not grounded on 
a priori differences in likelihood. Within an isolated state, preference can 
also be caused by observations entailing a certain abnormality. Our approach 
accounts for this via the notion of disqualifying conditions. On the other 
hand, the various states that are passed during the evolution of a system are 
not at all isolated. Consequently, preference of some abnormality over another 
one may also be caused by the dynamics of action theories. That is to say, 
a particular abnormal disqualification might naturally come about by state 
transition, in which case causality may provide good reasons for preferring 
this very abnormality over alternatives. The following simple extension of 
our introductory example illustrates this point and shows that our account 



^ More to nonmonotonic vs. probabilistic reasoning in the brief historical account 
in Section 3.7. 
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Figure 3.1. In general, we would consider it abnormal if we failed to start our 
car. But would we still do so if we deliberately insert a potato into the tail pipe 
beforehand? 

of the Qualification Problem so far might fail to recognize intuitively expected 
preferences among abnormalities. 

Example 3.2.2. Let V be the ramification domain of Example 3.2.1 but in- 
cluding the fluent name heavy^, denoting whether its argument is a heavy 
object, and the action name insert^, denoting the introduction of an object 
into the tail pipe. The new action is accompanied by the law 

insert(x) transforms {->in(a;)} into {in(a;)} 

The action is, however, abnormally disqualified if the object to be inserted is 
too heavy, which is accounted for by the disqualifying condition heavy(a:) D 
dzsg(insert(a;)) in addition to in(a;) D dzs(j'(ignite). 

Suppose it is known that the engine is not running and the tail pipe is not 
housing a potato. What, then, would be the expected outcome of first trying 
to insert a potato and, afterwards, trying to start the engine (c.f. Fig. 3.1)? 
Since there is no reason to believe that the potato is too heavy, the first action 
should be assumed qualified by default. The tail pipe thus becoming clogged 
disqualifies the second action. While this is the expected conclusion, our for- 
mal approach suggests equally plausible a second course of events. To see why, 
let O consist of the single observation -runs A ^in(pt) after []. Suppose S 
is the transition model of 2?, and let Res\ and Res 2 be two partial mappings 
from action sequences to states designed as shown in Fig. 3.2. Then both in- 
terpretations Ml = (S, Resi, Abi) with Abi = {[insert(pt), ignite]} and 
M 2 = {S, Res 2 ,Ab 2 ) with A &2 = {[insert(pt)]}) are models of {0,T>). 
The first model. Mi , suggests that inserting the potato is successful and, 
hence, renders it impossible to start the engine afterwards — ^just as expected. 
Model M 2 , on the other hand, declares the action of inserting the potato 
unqualified in the first place. The crucial point, now, is that by assuming this 
disqualification one avoids to grant a disqualification of starting the car. Con- 
sequently, Abi % A& 2 , hence Mi 7 ^ M 2 (nor, of course, M 2 A Mi). Since 
there is obviously no model of {0,V) of the form (£■, i?es, {}),^ both Mi 
and M 2 are preferred. Thus the observation 

® The reason being that either of [insert(pt)] or [insert(pt), ignite] mnst be 
abnormally disqualified according to the underlying transition model in con- 
junction with the disqualifying conditions. 
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Resi : 



-iruns, 

^in(pt), 

-iheavy(pt) 




runs, 

^in(pt), 

-iheavy(pt) 



-runs, 

in(pt), 

-iheavy(pt) 



1 


r -runs, 1 


[ ignite J 


r runs, '] 


Res2 : S 


1 ^in(pt), 

1 heavy(pt) J 


1 heavy(pt) J 


f 1 



Figure 3.2. The scenario O = {-iruns A -iin(pt) after []} suggests two models. 
Our model preference criterion is not sophisticated enough to distinguish between 
them, although the one determined by Resi is much more plausible. 



ignite inexecutable after [insert(pt)] 
for instance, is not entailed, contrary to our expectations. ■ 

It should be stressed that the reason for the second model being counter- 
intuitive is not that in general potatoes being found in tail pipes is more likely 
than these being too heavy to lift. Rather we have encountered the general 
problem that global minimization of abnormalities does not allow to distin- 
guish disqualifications which can be explained from the perspective of causal- 
ity. Successfully introducing a potato into the tail pipe produces an effect 
which causes the fact that the second action, starting the engine, is unquali- 
fied. That is to say, while an abnormal disqualification of insert(pt) comes 
out of the blue in the unintended preferred model, M 2 , an abnormal disqual- 
ification of ignite, as claimed in model Mi, is easily explicable. One even 
tends to not call the latter abnormal since being unable to start the engine 
after having clogged the tail pipe is, after all, what one can reasonably expect. 

Thus we see that the dynamics of state transition possibly gives rise to 
a special kind of preference among abnormal disqualifications — a preference 
which is not entailed by observations nor by a priori differences in likelihood. 
Our account of the Qualification Problem therefore needs revision. 



3.3 Causing Abnormal Action Disqualifications 

When assuming away abnormal disqualifications of actions in order to tackle 
the Qualification Problem, care has to be taken that any such assumption 
is withheld in case specific information hints at the presence of abnormal 
circumstances. We have seen that hints of this kind may be revealed by the 
dynamics of state transition, a fact which is ignored if the preference criterion 
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is given by global minimization. Abnormalities caused by the performance of 
actions need to be preferably accepted or, for that matter, not considered 
abnormal at all. The action of starting the car engine being disqualified, for 
instance, is a perfectly reasonable, since natural, consequence of introducing 
a potato into the tail pipe. 

With the last remark we seem to have pinned down the problem to the 
question of how to account for the fact that abnormalities may be obtained 
as a side effect of other actions. That is to say, the occurrence of an action 
disqualification is not abnormal if being evoked as indirect effect of another 
action. Put that way, the whole apparatus of Chapter 2 might furnish a 
ready fundamental for a solution to the problem of caused abnormalities: 
Suppose the expression disq{a) is a fluent stating whether or not action a 
is abnormally disqualified in a state. In the light of this new interpretation, a 
disqualifying condition F D disg(a) reveals itself as state constraint so that 
disq{a) holds whenever F is true. In particular, any such constraint should 
give rise to the indirect effect disq{a) whenever F is brought about. Fluent 
in(pt) being made true by some action, for instance, is thus expected to 
trigger the indirect effect disg(ignite) according to in(x) D dzsg(ignite), 
now to be taken as state constraint. 

Fluents representing disqualifications in conjunction with suitable state 
constraints enable us to model, by means of ramification, situations where an 
abnormality is caused by other actions. Whenever some disq{a) is initiated 
in this way, then that should no longer receive special attention insofar as 
comparing the abnormalities occurring in models is concerned. This reflects 
the intention to consider normal any case of abnormal action disqualifications 
which are deliberately brought about. The question remains, however, how 
the ‘real’ abnormalities are assumed away by default. If, as we have argued, 
this cannot be done globally over all states, then what is the alternative? 

The answer is that abnormalities are to be minimized solely in the initial 
state and so to leave it to both persistence and ramification to take care of the 
correct evolution of these fluents as time advances. We recall Example 3.2.2 
to supply the reader with a feeling of how that strategy solves the problem 
of causally motivated preferences among abnormalities. Let the two fluent 
formulas 

3x. in(a:) D ignite) 

Va;[heavy(x) D dzsg(insert(a;)) ] 

be state constraints. Given the observation ^runs A ^in(pt) after [], it is 
consistent to assume the initial state be 

S = {-runs, -iin(pt), -iheavy(pt), -idisg(ignite), -i(izsg(insert(pt)) } 

Noticeably, all abnormalities are denied in this state. It is clear, therefore, 
that S is preferred according to the proposed criterion. As a matter of fact, 
it is the only state that possesses this property. For if heavy(pt) were true, 
then this would entail an abnormality according to the second state constraint 
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of (3.1). Having both -irfzsg(insert(pt)) G S and ->in(pt) G S (the only 
regular precondition of inserting a potato), it follows that action insert(pt) 
is executable in S. The direct effect being in(pt), we expect the additional, 
indirect effect disq{±gn.±te) deriving from the first one of our state con- 
straints (3.1). The successor state resulting from performing insert(pt) thus 
correctly entails an abnormal disqualification of action ignite. To stress the 
point, we again draw the reader’s attention to the fact that the initial state S, 
although entailing an abnormality at a later timepoint, is in itself free of ab- 
normalities. 

Restricting minimization of abnormalities to the initial state thus provides 
a solution to our key example that proved global minimization inadequate 
as a formal account of the Qualification Problem. The question immediately 
rises, then, whether the improved approach is generally well-founded and not 
refutable itself. To begin with, the new minimization strategy is bound to 
fail in case it neglects preferences apparently to be made among alternative 
ways of minimizing in specific situations. Now, any such preference either 
grounds on static reasons, that is, it holds in any state regardless of both 
past and future states, or the reasons for a preference lie in the dynamics 
of state evolution. As regards static preferences, different degrees of a priori 
likelihood of abnormalities shall be neglected for the moment and dealt with 
in Section 3.5. For the moment the only possible static reason for preferring 
to accept an abnormality is additional information implying its presence. To 
account for this is the role of state constraints defining conditions for a fluent 
disq{a) being true. Since any acceptable state must satisfy these constraints 
by definition, our approach does respect static reasons for preferring to accept 
an abnormality. 

Reasons to accept an abnormality grounded on the dynamics of state tran- 
sition are linked to causality. That is to say, the acceptance of an abnormal 
disqualification may be founded on the performance of certain actions. Let 
us take for granted the unidirectionality of causality forward in time.^ Two 
points need to be made then. First, suppose that at some stage an action is 
performed which brings about a condition for an action a being abnormally 
disqualified. Whenever this happens, the corresponding fluent disq{a) should 
become true as side effect of the other action. This is achieved by any state 
constraint of the form F D disq{a) giving rise to the indirect effect disq{a) 
as soon as F gets invoked. Our solution to the Ramification Problem guar- 
antees this — provided that the underlying influence information entails that 
any fluent occurring in F potentially affects disq{a).^ 

^ This is not meant as offense to the minority of philosophers who take serions 
the idea of effects preceding their causes, nor to the minority of physicists 
who interpret quantum mechanics in such a way that it gives rise to backward 
causality. We appeal to common sense here. 

® This holds even if one abandons the rather idealistic assumption that a domain 
specification contain complete knowledge as to the possible reasons for an action 
disqualification. The trick is to introduce an artificial fluent representing the 
‘unknown’ cause; see Section 3.5 below. 
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The second point to be made, which completes the overall argument, is 
to justify our minimizing abnormalities initially. But given that causality 
is effective only forward in time, it is clear that no causal reason for an 
abnormality in the initial state can possibly be known of. This by no means 
implies that such a causal reason does not exist. But if it does, then it must 
lie outside the scenario specification, hence has no influence on the correct 
reasoning about this scenario. 

Having justified our strategy of how to cope with the Qualification Prob- 
lem, we proceed with developing a formal account of this approach. To begin 
with, any so-called qualification domain is supposed to include, for each ac- 
tion a, the fluent disq{a) stating whether or not action a is abnormally 
disqualified in a state. Abnormal disqualifications indicate abnormal circum- 
stances. The latter might be specified with the help of fluents which, too, 
are expected to be assumed false by default. Example fluents of this kind 
might be in(pt) and heavy(pt), as normally tail pipes are not clogged by 
potatoes, let alone the possibility of a potato being too heavy to lift. Fluents 
describing abnormal circumstances can be combined in state constraints to 
describe the conditions for a particular action being abnormally disqualified. 
We make no formal presuppositions as to the structure of these constraints, 
but we will later, in the following section, argue for a general strategy of how 
to design them. 

Definition 3.3.1. A plain® qualification domain D is a ramification do- 
main with a distinguished subset Tab, called abnormality fluents, of the set 
of all fluents so that disq{a) G Tab for each action a. The transition model 
of T> is a mapping E from pairs of an acceptable state and an action into 
(possibly empty) sets of states such that S' € E(S,a) iff ~<disq{a) € S and 
S' is successor of S and a. m 

Notice that the transition model of a qualification domain assigns no next 
state whenever an action is abnormally disqualified, regardless of whether 
successor states exist. 

Like in the approach based on global minimization, the new minimiza- 
tion policy is realized by means of a model preference criterion. Informally 
speaking, the less abnormality fluents are initially true the better. 

Definition 3.3.2. A plain qualification scenario is a pair {O^V) where V 
is a plain qualification domain and O is a set of observations. An interpreta- 
tion/or (0,1)) is a pair (E,Res) where E is the transition model of T> and 
Res is a partial function which maps finite (possibly empty) action sequences 
to acceptable states and which satisfies the following: 

1. i?es([]) is defined. 

2. For any sequence a* = [oi, . . . , Ofc_i, oj,] of actions (k > 0), 



® The reason for calling “plain” these qualification domains is that for the mo- 
ment all abnormalities are considered equally likely. 
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a) Res{a*) is defined if and only if i?es([oi, . . . , Ofc_i]) is defined and 
S{Res{[ai , . . . , afe_i]), a^) is not empty, and 
h) Res{a*) G r(_Res([ai, . . . , afe_i]), a^) . 

If I = {S, Res) and /' = {S, Res') are interpretations, then I is less 
abnormal than I', written I -< I', iff iZes([]) C\ iFab £ ^es^([]) ^ 

model of a plain qualification scenario {0,T>) is an interpretation (S,Res) 
such that each o € O is true in Res . A model is preferred iff there is no 
other model which is less abnormal. An observation o is entailed iff it is true 
in all preferred models of {0,T>). ■ 

Example 3.3.1. Let T> be the ramification domain of Example 3.2.2 but now 
including the fluent names dzs(/(ignite)° and dzsg(insert)^ plus the steady 
state constraints 



3x. in(x) D dzsg(ignite) 

Vcc [heavy(a:) D dzsg(insert(a;)) ] 

Consider Tab = {in(pt), heavy(pt), dzsg(ignite), dzsg(insert(pt))}, then 
2? is a (plain) qualification domain. Let O consist of the observation 

-runs after [ ] 

then {0,T>) is a qualification scenario. Suppose E be the transition model 
of T>. The domain being deterministic, interpretations (27, Res) are uniquely 
characterized by the initial state i?es([]), e.g. 

Resi{[]) = 

{^runs, -iin(pt), -iheavy(pt), -idzsg(ignite), -i(izsg(insert(pt))} 

Obviously, (E,Resi) is a model of {0,T>) as it satisfies -runs G i?esi([]). 
Since i?esi([]) ATab = {}> this model is preferred. Moreover, it is unique in 
that respect because the initial value of the only existing non-abnormality 
fluent, viz. runs, is fixed by the given observation. Thus whatever is true 
in this model is also entailed by our scenario, {0,T>). In particular, we have 
in(pt) G i?esi([insert(pt)]) and, hence, dzsg( ignite) G i?esi([insert(pt)]) 
according to transition model 27. Consequently, {0,T>) entails both the two 
observations 



in(pt) after [insert(pt)] 
ignite inexecutable after [insert(pt)] 
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Up to this point our discussion of the Qualification Problem revolved around 
the questions both of how to assume away abnormal disqualifications and 
of how to withdraw this assumption in case some condition for the corre- 
sponding abnormality is entailed. A related task is of equal importance for 
reasoning about actions in environments which are unpredictable insofar as 
success of actions cannot be guaranteed. Suppose an action turns out unex- 
ecutable. This should come as a surprise whenever all regular preconditions 
were satisfied and there was no reason to believe in an abnormality. Having 
nonetheless encountered the disqualification, it is both natural and reasonable 
to seek an explanation. An autonomous agent, for instance, whose current 
goal relies on the successful performance of the action that failed should try 
to figure out what went wrong in order to rectify it, if possible. This raises 
the question of what qualifies as an adequate explanation for an abnormality. 

The natural thing to do if an abnormal disqualification has been observed 
is to search the available domain knowledge for conditions which would entail 
it. Among all potential causes those offer an explanation which are compatible 
with the entire state of affairs. Suppose, for instance, our protagonist knows 
that starting the car is impossible if some object clogs the tail pipe, if the 
tank is empty, the battery is low, or if there is a general problem with the 
engine itself. Suppose further that she has checked both tail pipe and tank 
and has also confirmed the good status of the battery, say, by turning on the 
radio. To the best of her knowledge, then, failing to start the car must be 
caused by an engine problem. 

On the formal side, this generation of explanations for observed abnor- 
malities requires a specific policy of designing the state constraints that relate 
action disqualifications to their conditions. To see why, suppose the following 
four constraints specify all that is known as to reasons for failing to start a 
car. 

3a;. in(a;) D disq(ignite) 
tank-empty D disq(ignite) 

yo.Z j 

low-battery D disq{ign±te) 



engine-problem D dzsg(ignite) 



The conjunction of these formulas is logically equivalent to 



3x. in(a;) V tank-empty V low-battery V engine-problem 
D ignite) 



(3.3) 



This being a fluent formula that any acceptable state must satisfy, whenever 
at least one of the four (abnormal) causes for a disqualification of ignite 
holds at some point, then disq{±ga±te) holds as well. The converse, how- 
ever, is not necessarily true. That is to say, dzsg(ignite) may hold in a 
state without any of the potential causes on the left hand side of the above 
implication being true. Even worse, the four conditions supposedly belong 
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to the set of abnormality fluents since each describes rather unusual circum- 
stances. Consequently, not only do models exist which entail no explanation 
for dzsg(ignite) in case the latter is known to be true, these models are 
even preferred. 

The desired proliferation of explanations can, however, be achieved by a 
modification known as completion of the fluent formula (3.3). The problem 
with this implication, as it stands, being that it does not support any further 
conclusions from disg(ignite), it is transformed into a bi-conditional: 

3x. in(x) V tank-empty V low-battery V engine-problem , , 

= dzsg(ignite) ' ' 

\jet this formula replace the state constraints listed in (3.2). Then any ac- 
ceptable state which satisfies disq (ignite) must also satisfy at least one 
of 3x. in(a;), tank-empty, low-battery, or engine-problem. The three 
observations 



-•runs after [] 
ignite inexecutable after [] 

Vx. -'in(a;) A -itank-empty A -ilow-battery after [] 

for instance, entail engine-problem after [] on the basis of the completed 
state constraint. 

The mechanism of completion provides us with a general strategy of 
how to design state constraints so as to automatically supply explanations 
for observed action disqualifications. Suppose n different abnormal condi- 
tions, specified by fluent formulas Ti, . . . ,f„, are known to render impos- 
sible the execution of action a. Instead of introducing the n implicational 
state constraints Fi D disq(a), . . . , Fn D disq(a), we employ a (stronger) 
bi-conditional as follows. 

n 

\J Fi = disq(a) (3.5) 

i=l 

It obviously entails the aforementioned n implications, but in addition allows 
to reason the other direction, i.e., from abnormalities to their possible causes. 
While this is an invaluable gain of using completion, there is, however, an im- 
portant objection against state constraints of the form (3.5). Namely, equat- 
ing a disqualification with a disjunction of conditions presupposes complete 
knowledge as to the possible reasons for an action to fail. No doubt, abnormal 
action disqualifications being rare exceptions by definition, the situation is 
even more exceptional where this disqualification cannot be explained. Yet, 
not accounting for these cases seems to ignore the fact that generally only 
partial knowledge can be acquired of real-world domains. Completion being 
too valuable to abandon, however, we favor the development of additional 
means to deal with inexplicable abnormalities. 

Prior to so doing, let us mention another crucial issue when dealing with 
action disqualifications, which stresses the importance of completed state 
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constraints. Suppose our protagonist clears the tail pipe again after having 
successfully introduced a potato. If initially starting the engine has been 
assumed qualified by default, then this assumption had to be withdrawn after 
having clogged the tail pipe. But the assumption of qualification is to be re- 
installed in the following situation where the potato has been removed again. 
For if prior to performing the two actions nothing hints at an abnormality, 
then there is no reason to believe that things have been changed by these 
two actions with reverse effect. This revoking qualification is granted if state 
constraints of the form (3.5) are employed. To see why, suppose that initially 
each Fi (1 < f < n) and, hence, disq{a) is assumed false by default. Suppose 
further that later on one particular Fj^ € {F^i, • • ■ ,F'„} is initiated and that 
sometime thereafter F), is terminated again. Then -<disq{a) must hold at 
the end. This follows from persistence of each ^F j (1 < j < n, j ^ k) and 
from the fact that AT=i ^ implies ~<disq{a) according to the completed 
state constraint. 

Let us summarize and illustrate the discussion on conditions for abnormal 
action disqualifications by a fully formalized example which highlights the 
essential aspects. 

Example 3.4-. 1. Let T> be the plain qualification domain consisting of the 
entity pt, the fluent names T = {runs°, in^, tank-empty*^, low-battery*^, 
engine-problem**, heavy*^, dzsg(ignite)**, dzsg(insert)*, dzsg(clear)**} such 
that only runs is a non- abnormality fluent, the action names A = {ignite**, 
insert*, clear**}, the action laws 



ignite 


transforms 


{-•runs} 


into 


{runs} 


insert(a;) 


transforms 


|^in(a;)} 


into 


{in(a;)} 


clear 


transforms 


{in(a;)} 


into 


{-in(a:)} 


the three steady state constraints 









[3x. in(x) V tank-empty V low-battery V engine-problem 

= dzsg(ignite) ] 

heavy(x) = dzsg(insert(a;)) 

T = dzsg(clear) 

and, obtainable through the influence information 

X = |(heavy(a:), dzsg(insert(a;)))} 

U{(/, disg(ignite)) : 

/ G |in(a;), tank-empty, low-battery, engine-problem}} 
these ten (steady) causal relationships: 
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in(a;) causes disq{±gn.±te) if T 

tank-empty causes disq{±gn.±te) if T 

low-battery causes disq{±gn.±te) if T 

engine-problem causes disq{±gn.±te) if T 

^in(x) causes ~<disq{±ga±te) 

if Vj/. -i±n(y) A -itank-empty 

A -ilow-battery A ^engine-problem 
-■tank-empty causes -'disq{±gD.±te) 

if Vj/. -'±n(y) A -ilow-battery 
A -lengine-problem 
-■low-battery causes -■dzsg(ignite) 

if Vj/. -<±n(y) A -■tank-empty 
A -■engine-problem 

-■engine-problem causes -■rfzs(/(ignite) 

if Vj/. ^in(y) A -■tank-empty A -■low-battery 
heavy(a;) causes disq{±nseTt{x)) if T 

^heavy(a;) causes -■dzsg(insert(a;)) if T 

1. Let Oi consist of the observations 

-■runs after [ ] 

ignite inexecutable after [] 

Vx. -■in(x) A -■tank-empty A -■low-battery after [] 

Then the qualification scenario {Oi,T>) entails engine-problem after [ ] . 

2. Let O 2 consist of the only observation 

tank-empty after [] 

Then the scenario {02,T>) entails ignite inexecutable after []. Notice 
that despite this disqualification any preferred model {S, Res) of the 
scenario satisfies -■in(pt), -■low-battery, -■engine-problem € _Res([]), 
for all these fluents belong to the set of abnormality fluents. Consequently, 
{ 02 ,V) also entails 

Vx. -■in(x) A -■low-battery, -■engine-problem after [] 

That is to say, although action ignite is known to be abnormally dis- 
qualified, we still conclude, by default, that the tail pipe is clear and that 
both battery and engine are in order. 

3. Finally, let O 3 consist of the observation 



■runs after [ ] 
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Then the scenario entails in(pt) after [insert(pt)], hence 

also ignite inexecutable after [insert(pt)] . Moreover, the observa- 
tion -iin(pt) after [insert(pt), clear] and, hence, the observation 
-'disq{±ga±te) after [insert(pt), clear] are entailed. Mainly responsible 
for this last conclusion is the causal relationship 

->in(a;) causes -'(izsg(ignite) if Vy. -'in(j/) A -■tank-empty 

A -ilow-battery 
A -lengine-problem 

Thus, in addition to the above, { 03 ,T>) entails 

runs after [insert(pt), clear, ignite] 

which shows how qualification gets revoked once the only reason for a 
disqualification disappears. 



3.5 Degrees of Abnormality 

Our account of the Qualification Problem so far makes two simplifying as- 
sumptions. First, all abnormalities are considered equally unlikely. Second, 
complete knowledge as to all possible causes for an abnormal disqualifica- 
tion is implicitly assumed by equating the abnormality with these causes. 
A natural extension of the current theory is to allow different degrees of 
abnormality. In this way, it becomes possible to specify, for instance, that 
having been running out of gas is more likely than a low battery, which in 
turn is more plausible an explanation for being unable to start the car than 
a potato being placed in the tail pipe. Different levels of likelihood are thus 
representable without the necessity to provide precise probabilities. 

The introduction of degrees of abnormality incidentally also provides a 
ready means to resolves the problem associated with our equating abnormal 
disqualifications with their possible causes. In addition to all conceivable 
reasons for an abnormality to occur, one fluent in the list of causes may 
represent the unknown cause. Suppose a domain constraint disq{a) = Vi Fi 
is weakened to the effect that disq{a) = \JiFiW mysterious(a) . Then it 
is no longer violated in case a has been observed abnormally disqualified 
while each known cause, Fi, has been proved false. ^ Suppose further that 
‘abnormality’ fluent mysterious(a) has higher degree of improbability than 



^ Introducing the unknown cause allows to model, for example, the tendency of 
some human beings to hit out at mechanical devices, e.g., dispensing machines, 
when discovering their malfunctioning. The foregoing reasoning process there, if 
any, attributes the encountered malfunction to some unobservable fluent which 
is hoped to being manipulable by means of that very action. 
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any other, concrete cause. Then the unknown cause is used only as a last 
resort when it comes to explaining an abnormality. 

On the formal side, degrees of abnormality are represented by a par- 
tial ordering among the ‘abnormality’ fluents. If, for instance, we spec- 
ify that tank-empty < low-battery, then low-battery shall always 
be minimized — initially — with higher priority than tank-empty. Accord- 
ingly, the concept of unknown causes is adequately modeled by defining 
fab < mysterious(a) for all other ‘abnormality’ fluents fat- Property 
mysterious(a) is thus assigned the highest degree of abnormality. Being 
a partial ordering, the comparison relation < may be indifferent regarding 
some pairs of fluents, in which case no preference is made for either of them. 
For a formal introduction to partial orderings and related concepts see An- 
notation 3.2. In the following, we extend our solution to the Qualification 
Problem to the effect that different degrees of abnormality are supported. 



A binary relation R on some set A is a subset of the Cartesian product Ax A, 
that is, A is a set of pairs (a, b) where a,b G A . If (a, b) G R, then this is also 
written aRb . A binary relation R may obey the following properties: 

Va G A. -■ aRa (irreflexive) 

Va, 6 G A. {aRb A bRa Zb a — b) (antisymmetric) 

ya,b,c G A. {aRb A bRc D aRc) (transitive) 

If it does, then the relation A is a partial ordering on A. If in addition A 
satishes Va, 6 G A. {aRb V bRa), then A is strict. A strict ordering R' is an 
extension of a partial ordering A iff A' 3 A, that is, whenever aRb then also 
aR'b. Let, for example, A be 

{tank-empty, low-battery, engine-problem, in(pt), mysterious(ignite)} 

and suppose a partial ordering < be given by 

tank-empty < low-battery < in(pt) < mysterious(ignite) 
tank-empty < engine-problem < mysterious(ignite) 

Let <C be < augmented by low-battery ^in(pt) ^engine-problem , then <C 
is one out of three possible strict orderings extending <. 



Annotation 3.2. Orderings. 



Definition 3.5.1. A qualification domain T> is a plain qualification domain 
augmented by a partial ordering < on the set of abnormality fluents. Accord- 
ingly, a qvLaliG.ca,tion scenario {0,T>) consists of a set O of observations and 
a qualification domain T>. 

Suppose M = {S,Res) is a model of {0,T>). Then M is preferred iff 
there is a strict ordering <C which extends < and such that for all mod- 
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els M' = {E,Res') of (0,V) and all fab G Eab-’ If fab G i?es([]) \ i?es'([]), 
then there is some fab' fab such that fab' G i?es^([]) \ i?es([]). ■ 

In words, a preferred model is obtained by first choosing a minimization 
strategy, that is, a strict ordering which respects the given partial one. Then 
the model is preferred whose evolution function Res satisfies the following: 
Suppose some abnormality fluent fab is initially true in Res but false in the 
evolution function Res of some other model. Then there must be another 
abnormality fluent fab of higher priority than fab according to the chosen 
strict ordering and which is initially false in Res but true in Res . 

Example 3.5.1. Let T> be the qualification domain of Example 3.4.1 but with 
abnormality fluent mysterious(ignite) added as an additional cause for 
dzs(/(ignite). Furthermore, let the following information regarding degrees 
of abnormality be given: 

tank-empty < low-battery < in(pt) < mysterious(ignite) 
tank-empty < engine-problem < mysterious(ignite) 

Suppose E be the transition model of V, and let O consist of the observa- 
tions 

-iruns after [] 
ignite inexecutable after [] 

-•tank-empty after [] 

Given that tauik-empty is false initially in any model of {0,T>), each 
of low-battery, engine-problem, in(pt), mysterious(ignite) offers as ex- 
planation for the observed disqualification. Following the a priori knowl- 
edge of likelihood given by <, we obtain two preferred models, namely. 
Ml = {E,Res\), where Resi{[\) C\ Tab = {low-battery, dzsg(ignite)}, and 
M 2 = {E, Res 2 ), where i?es 2 ([]) I^Tab = (engine-problem, dzsg(ignite)}.® 
Consequently, low-battery V engine-problem after [] is entailed by the 
qualification domain (0,21), as is -■in(pt) A -•mysterious(ignite) after [ ] . 
Highly unlikely causes such as a potato being in the tail pipe, let alone a 
mysterious disqualification, are thus assumed away. ■ 

This completes our formal account of the Qualification Problem. Let us 
summarize: A qualification domain is supposed to contain a distinguished set 
of fluents Tab, each of which describes abnormal circumstances and thus is 
to be assumed false by default. This assumption, however, needs to be re- 
stricted to the initial state, so that these fluents are subject to the general 
law of persistence but are also potentially (directly or indirectly) affected by 
the performance of actions. Among these so-called abnormality fluents are 
expressions, denoted disq{a), which represent the property of action a be- 
ing abnormally disqualified. State constraints relating disq{a) with possible 

® Tacitly assuming that both Res\ and Res 2 are otherwise arbitrary but satisfy 
the conditions of Definition 3.3.2. 
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causes support the proliferation of explanations in case an abnormal disqual- 
ification surprisingly occurs. Additional information may be provided as to 
different degrees of abnormality. The latter also furnishes a ready approach 
to accommodate situations in which a suitable explanation is not possible. 
The default assumption of ‘normality’ is formally captured by a model prefer- 
ence criterion, which induces a nonmonotonic entailment relation — a feature 
inherently connected with the Qualification Problem. 



3.6 A Fluent Calculus Axiomatization 

The adaptation of our Fluent Calculus-based axiomatization at the end of 
Chapter 2 to qualification domains and scenarios requires two issues be ad- 
dressed. First, the notion of action sequences being qualified needs refinement. 
The existence of a successor state is still necessary but no longer sufficient 
a criterion. What needs additionally be guaranteed is that no abnormal dis- 
qualification is present. Modifying the axiomatization to this extent is rather 
straightforward. Second, and more substantial, some qualitatively new mech- 
anism needs to be introduced to account for nonmonotonicity as an intrinsic 
feature of the Qualification Problem. Recall that our axiomatization of ram- 
ification domains and scenarios uses pure classical logic, hence is monotonic. 
The nonmonotonic framework in which it is to be embedded serves the pur- 
pose of assuming away abnormality by default. 



3.6.1 Axiomatizing Qualification 



The concept of action sequences being qualified has been defined, in the 
axiomatization of ramification in Chapter 2, by the following two axioms. 



Qualified{[a*\a\) 



Qualified {[]) 
Qualified{a*) A 3s, s' 



Result {a*, s) A 
Successor{s, a, s') 



(3.6) 



(The reader may recall that Qualified{a*) indicates that action sequence a* 
is qualified, i.e., executable; Result{a*,s) determines state s as the result 
of performing a*; and Successor's, a, s') is true iff s' is a successor state 
of s and action a.) These axioms reflect the (idealistic) view that an action 
can be successfully performed whenever the regular preconditions are met. 
When addressing the Qualification Problem, an additional condition needs 
to be accounted for: The action in question must not be abnormally disqual- 
ified. This can be axiomatized with the help of the fluents disq{a). The new 
definition of qualification, replacing (3.6), is as follows: 



Qualified{[]) 



(3.7) 
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Qualified {[a* \a\) 



Qualified{a*) A 3s, s' 



Result {a* , s) A 
Holds{-<disq{a), s) A 
Successor{s, a, s') 



(3.8) 



The second axiom defines, in words, an action sequence [a*|a] to be qualified 
if so is a * , if the result s of performing a* does not entail an abnormal dis- 
qualification as regards a, and if there exists a successor s' of s and a. The 
reader may liken this to the notion of the model component Res being de- 
fined for the argument [a*|a] (c.f. Definitions 3.3.1 and 3.3.2 in Section 3.3). 
No further foundational axioms introduced in Chapter 2 require replacement. 



Example 3.6.1. Let T> be the qualification domain consisting of the en- 
tity pt; fluent names runs®, in^, mysterious(ignite)®, dzs(/(ignite)®, 
so that only runs is an abnormality fluent; the partial priority ordering 
given by in(pt) < mysterious(ignite); action name ignite®; action law 
ignite transforms {-•runs} into {runs}; and the steady state constraint 



dzsg(ignite) = 3x. in(cc) V mysterious(ignite) 



Let TCx! denote the Fluent Calculus-based axiomatization of this domain 
as described in Chapter 2 (c.f. Section 2.9.2). Consider, then, the collection 
of classical formulas TCv U {(3. 7), (3. 8 ), (2.37)-(2.40)|, and suppose that 
-'Qualified{[l^'n.'ite\) is true. The latter, in conjunction with axioms (3.7) 
and (3.8), implies that for all s, s' we have 

~'Result{[\, s) V -<Holds{-<disq{±ga±te), s) V ->Suecessor{s, ignite, s') (3.9) 

The domain-independent axioms entail 3s [Result{[], s) A State(s)]. Correct- 
ness of 1FCt> moreover ensures that State{s) D Vs'. -'S'Mccessor(s, ignite, s') 
is entailed iff so is Holds {rmis, s), for ->runs is the one and only regular pre- 
condition of action ignite. Put together, the disjunction (3.9) implies the 
following. 

Vs [i?esMZt([], s) D Holds{disq{±gn.±te), s) V Holds{rwa.s, s)] 

That is to say, ignite being unexecutable, i.e., ~<Qualified{[±ga±te]) , im- 
plies either an abnormal disqualification or that the engine is already running. 



On the analogy of a result in Chapter 2, a one-to-one correspondence 
holds between the classical models of IFCt) U |(3.7),(3.8),(2.37)-(2.40)} and 
the interpretations for qualification scenarios of domain T>. More precisely, 
let L be some model of the aforementioned collection of classical formulas 
and I = {E, Res) an interpretation for a scenario {0,T>). Then we say that 
t and I correspond iff for all action sequences a*, states S, and collections 
of fluent literals s such that EUNA |= s = T 5 , we find that 

Res{a*) = S iff [Result {a* , .s)]' is true 
On this basis we can prove the following: 
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Theorem 3.6.1. Let {0,T>) he a qualification scenario with transition model 
E and Fluent Calculus axiomatization TCx>- Then for each model l of 
TCxi U {(3.7),(3.8),(2.37)-(2.40)} there exists a corresponding interpreta- 
tion {Res,E) for {0,T>) and vice versa. 

Proof. We confine ourselves to the differences to the proof of Theorem 2.9.2, 
which arise from the refined notion of qualified action sequences. The modi- 
fied axiom (3.8) additionally requires that the fluent disq{a) be false in the 
state resulting from performing a* . This is exactly what distinguishes Defi- 
nition 3.3.1 from Definition 2.7.2, regarding the notion of transition models. 

Qed. 

From now onwards, let — given a qualification scenario {0,T>) — W(p^xi) 
denote the classical formulas tFCj) U {(3.7),(3.8),(2.37)-(2.40)} plus the ax- 
iomatization of the observations in O.® Observations F after [oi, . . . , a„] are 
formalized as before, that is, 

3s [ Result{[ai , . . . , a„], s) A Flolds{F, s) ] (3.10) 

while a disqualification observation a inexecutable after [ai , . . . , a„] is rep- 
resented by 

Qualified{[ai, . . . ,a„]) A ~<Qualified{[ai, . . . , an, a]) (3-11) 

As one would expect, there is a one-to-one correspondence between the set 
of classical models of Wppj and the set of models of the scenario (0,T>). 

Corollary 3.6.1. Let (0,T>) be a qualification scenario with axiomatiza- 
tion Wp x>)> then for each model l of W (^0 xi) there exists a corresponding 
model {Res,E) of {0,T>) and vice versa. 

Proof. Following Theorem 3.6.1 it suffices to show that an observation is true 
in Res iff t is model of the observation’s axiomatization. 

1. By definition, an observation F after [ai,...,a„] is true in Res iff 
Res{[ai, . . . ,an]) is defined and formula F holds in that state. This 
in turn is equivalent to t. being a model of formula (3.10) according to 
axiom (2.38), which stipulates that 3 s. Re suit {a* , s) iff Qualified (a*); 
Proposition 2.9.3, which asserts that Holds is suitably defined; and the 
fact that L and {E, Res) correspond. 

2. By definition, an observation a inexecutable after [ai,...,a„] is true 
in Res iff Res{[ai, . . . , an]) is defined but Res{[ai, . . . ,an,a]) is not. 
This in turn is equivalent to l being a model of formula (3.11) according 
to the fact that l and {E, Res) correspond. 

Qed. 



® The reason for the at first glance unmotivated use of the letter W reveals in 
the following section. 
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3.6.2 Introducing Nonmonotonicity 

The entailment relation of an action theory being a nonmonotonic one has 
been shown a fundamental characteristics of the Qualification Problem. Any 
axiomatization suitable for qualification domains and scenarios must there- 
fore be based on some nonmonotonic extension to classical logic. Speaking less 
abstractly, this going beyond classical logic is necessary in order that abnor- 
malities can be minimized, i.e., assumed away by default. We will meet this 
requirement by embedding our current Fluent Calculus-based axiomatization 
into a so-called default theory. Hence the general nonmonotonic framework 
to be employed is Default Logic, or rather, to be more precise, a conceptual 
extension of the original approach called Prioritized Default Logic. The latter 
is vital for reflecting possible degrees of abnormality when minimizing. For 
a formal introduction to both Default Logic and its prioritized variant see 
Annotation 3.3. 

Given a qualification scenario {0,T>), on the basis of the Fluent Calculus- 
axiomatization FF(c)_x)) we construct a prioritized default theory as follows. 
The classical logic formulas W(^o,Ti) constitute the background knowledge 
(which, by the way, resolves the mystery of why this denotation has been 
chosen). The various assumptions of ‘normality’ are formalized as default 
rules. For each abnormality fluent fab G iPab , let S denote the default rule 

: Vs D ^Holds{fgb, s)] lol 

Vs s) D ~'Holds{fab, s)] 

which is used to express the default assumptions that fab be false initially. 
That is to say, as long as it is consistent to assume that fab does not hold in 
the initial state, we do make this assumption. Let Djy denote the set of all de- 
fault rules thus obtained from domain D. Furthermore, let a partial ordering 
<X) be defined on Dj) so that Sf^<xi5f^' whenever fab < fab according 
to the partial ordering defined on the set Tab of abnormality fluents. 

This completes the axiomatization of qualification scenarios {0,T>) as 
prioritized default theories D-p, <x>). Before we enter the 

proof of correctness, let us check on it by an example. 

Example 3.6.2. Let V be the qualification domain of Example 3.2.1. Then 
Dx> consists of the default rules f^lnCpot)? ^mysterious (ignite) , and <^£(js,^(ignite) ■ 



: Vs [ Result{[], s) D -'i7oWs(in(pt), s) ] 

Vs [Result{[], s) D -'HoWs(in(pt), s) ] 

: Vs [i?esMZt([], s) D -'HoWs(mysterious(ignite), s) ] ^3 

Vs [i?esMZt([], s) D -'i7oZ(is(mysterious(ignite), s) ] 

: Vs [i?esM^t([], s) D -^Holds{disq{±^±te),s)] 

Vs [i?esM^t([], s) D ~'Holds{disq{±^±te),s)] 

where (5in(pot) <d <5mysterious(ignite) due to in(pt) < mysterious(ignite) . 
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The purpose of Default Logic is to extend classical logic, which is taken to en- 
code precise knowledge, by expressions that formalize somehow vague, defeasible 
knowledge. Called default rules (or defaults, for short), these expressions allow 
for stating that some property a ‘normally’ implies some property u>. The ref- 
erence to normality is made precise by specifying circumstances -■/? which must 
provably hold in order that the conclusion from a to uj shall not be valid. The 
formal syntax of a default is where a (the prerequisite), f) (the justi- 

fication), and w (the consequence) all are formulas in classical logic. For our 
purpose, it suffices to consider only so-called normal defaults, where justihcation 
and consequence coincide. In case one or more components of a default contain 
free variables, the then so-called open default is considered representative for all 
of its ground instances. A default theory A = (W, D) consists of a set of de- 
faults D and a set of closed formulas W , the latter of which is called world- (or 
background) knowledge. Reasoning in default theories is based on the formation of 
so-called extensions. These are obtained by starting with the background knowl- 
edge, W , and successively applying defaults chosen from D , which amounts to 
adding their consequences provided their justihcation is consistent with what is 
Hnally obtained as extension. Once there are no more applicable defaults left, the 
deductive closure of the resulting set of formulas constitutes an extension. An 
extension may be regarded as one coherent view on the state of affairs. Formally, 
let be a set of closed formulas. We dehne 

1. To = IT 

2. Fi = Th{Fi-i) U {w : ^ € D, a € Fi-i, -•w 0 E}, for i > 0. 

(where ThifF) denotes the (classical) deductive closure of d>). Then E is an 
extension of A iff = . A default theory may admit multiple extensions, 

each of which is obtained by applying different subsets of the underlying defaults. 
Then a closed formula is said to be skeptically entailed in a default theory iff it 
is contained in all extensions of the latter. 

When constructing extensions of a default theory in the way just described, all de- 
faults are applied with the same priority. For applications where this is undesired, 
an extension of classical Default Logic called Prioritized Default Logic supports 
the specihcation of (possibly partial) preference orderings among defaults. This 
ordering is exploited to select among the extensions of a default theory those 
in which the most preferred defaults have been applied. Formally, a prioritized 
default theory is a triple A — (IF, D, <) where IF and D are as in standard 
Default Logic and < is a partial ordering on D. If E is a closed set of formulas, 
then a default is said to be applied in E iS a,LO € E . An extension E of 
the (standard) default theory (IF, D) is a prioritized extension of A iff there 
is a strict ordering <C extending < such that the following holds for all exten- 
sions E' of (IF, D) and all defaults S' G D: It S' is applied in E' \E, then 
there is some 5<^S' which is applied in E\E' .In words, a standard extension E 
is prioritized if we can find a strict ordering respecting < and the following is 
true: Whenever some default 5' is not applied in E but in some other standard 
extension E' , then there is also a default <5 which is applied in E but not in E' 
and which has higher priority than S' according to the strict ordering. 



Annotation 3.3. Classical and Prioritized Default Logic. 
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Now, suppose O consists of the observations 

-runs after [] 
ignite inexecutable after [] 

We have already seen that bF(o,x>) and the axiomatization of the second 
observation, viz. Qualified{[\) A ~'Qualified{[±^±te\) , entail 

Vs s) D i/oWs(dzsg(ignite), s) V i/oWs(runs, s) ] 

The encoding of the first observation, viz. 3s [ Result{[], s) A i/oWs (-runs, s) ], 
in conjunction with the axiom Result{[], s) A Result{[], s') D s = s' allows 
to strengthen the above to 

Vs [i?esM?t([], s) D Holds (disq{±gD.±te), s)] 

It follows that default <5dis?(ignite) cannot be applied. From the state con- 
straint disq{±gD.±te) = 3a;. in(x) V mysterious(ignite) in J^Cx> we also 
conclude that 

Vs [ i?esM?t([], s) D 3x. Holds s) V iVoWs(mysterious(ignite), s) ] 

Since entity pt is the only one, it follows that <5in(pot) and <5mysterious(ignite) 
are mutually exclusive, for either consequence in conjunction with W(^(j x') 
implies the negation of the other rule’s justification. According to the prior- 
ity ordering <x>, default iJinCpot) is to be preferably applied when seeking 
prioritized extensions of Our default theory A(^o x') thus admits a 

unique prioritized extension E, which includes the following formula. 

Vs[i?esrtZt([], s) D s = -runso in(pt) o-imysterious(ignite) o dzsg(ignite)] 

This formula is skeptically entailed by -p) • The reader may notice that ac- 
cordingly the qualification scenario {0,T>) admits a unique preferred model 
(A, Res), which satisfies 

i?es([]) = {-runs, in(pt), -'mysterious(ignite), dzs(/(ignite)} 



In the remainder of this section, we prove general correctness of our ax- 
iomatization of qualification scenarios by means of Prioritized Default Logic. 
Some preparations are required to this end. Let A(^q x>) be the axiomati- 
zation of a qualification scenario {0,T>) with Tab being the abnormality 
fluents. Then a prioritized extension E of A(^q x') &ud an interpretation 
(A, Res) for this scenario are said to eorrespond iff for all fab & J^ab we And 
that 

-•fab & Res{[]) iff 'is[Result{[],s) Z) ~'Holds{fab,s)] G E (3.14) 

The notion of potential extensions to be introduced next will be crucial 
for the proof of our main theorem. In what follows, for notational convenience 
we use the abbreviation Initially {-^i) for Vs [ s) D ~<Holds{£, s)]. 

Suppose F is a set of formulas which consists in the following: 
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1. VP(o,D); 

2. either Initially fab) or -> Initially {-if at), for each abnormality flu- 
ent fab- 

Then E = Th{F) is called a potential extension of A(^q x>)- Notice that po- 
tential extensions may be inconsistent, e.g., if Initially {-< fab) G F although 
^( 0 , 25 ) necessitates that fab be true in any s which satisfies Result{[], s) . 

Given a potential extension E = Th{E), we call induced hy E any strict 
ordering that extends <x> such that 

whenever Initially {~i fab) G F and -> Initially {-if '^ffj G F 

Induced orderings will be used below to verify the constituent properties for 
potential extensions being prioritized extensions. It is easy to verify that the 
standard extensions of the default theory (H^(o,'D)j -Dd) are always potential 
extensions. 

Lemma 3.6.1. Let A(^q x,) = {W(^o , Dx> , <v) be the axiomatization of 
some qualification scenario, then each (standard) extension of {W(o,v),Dj 3 ) 
is a potential extension of A(p x,y 

Proof. Let E be an extension of {D-ti,W(^o,'D)) , and let 

1. Fq = IL(o,d); 

2. El = Th{Eo) U {uj : G Dxi, -k-o ^ E}] and 

3. F2 = Th{Ei). 

Since all possibly applicable defaults in Dj) have been applied to com- 
pute Fi and since E is extension, we know that E 2 = E. By construc- 
tion, / 2 , hence E, is subset of some potential extension. To see, then, 
why it equals a potential extension, observe first that E = Th{E) and 
hL(o^D) G E. It remains to verify that for every fab G Fab, E includes 
either Initially {-i fab) or Initially {-i f ab) ■ Let fab G Fab be an abnormal- 
ity fluent. From G Dx> and the construction of A, we 

know that either Initially {-i fab) G A, hence Initially {-i fab) G E, or else 
-I Initially {-I fab) G E . Qed. 

The notion of correspondence between interpretations for qualification 
scenarios and prioritized extensions generalizes to potential extension in 
the obvious way — a potential extension E = Th{F) and an interpretation 
{S,Res) correspond iff the condition (3.14) holds for all fab G Fab- No- 
tice that each interpretation has a unique corresponding potential extension, 
whereas there may be multiple interpretations corresponding to a single po- 
tential extension. For the latter does not necessarily fix all states resulting 
from the performance of action sequences. Notice further that whenever E is 
consistent then there exists a corresponding interpretation which is a model 
of the qualification scenario at hand. This is granted by Corollary 3.6.1, for 
if E is consistent it admits a (classical) model t — which then corresponds 
to some model of the scenario. 
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We are now prepared to prove general correctness of our entire axiomati- 
zation. 

Theorem 3.6.2. Let {0,T>) he a qualification scenario whose axiomatiza- 
tion is the prioritized default theory Ai^q j)) = (W{ 0 ,v)-, D-v, <v) , then for 
each prioritized extension of A/^q j}) there exists a corresponding preferred 
model of {0,T>) and vice versa. 

Proof. Let iFab be the set of abnormality fluents of T>. 

U V . 

Let M = (E,Res) be some preferred model of {0,T>), and let E = Th{F) 
be the potential extension corresponding to M . To begin with, we prove 
that if is a standard extension of {W(o ,7)) , Dv) ■ Let 

1 . io = W(o.T>)'^ 

2. El = Th{Po) U {oj : -ff- G Dxi, -•uj ^ E}; and 

3. E2 = Th{E,). 

Then we have to verify that E 2 = E (c.f. the proof of Lemma 3.6.1). Clearly, 
E 2 Q E, since for any G Dj) such that w G Ei, we have -'CO ^ E, which 
in turn implies oj G E given that if is a potential extension. Moreover, the 
assumption E 2 E leads to a contradiction: Suppose E 2 £ E, then this 
indicates the existence of some G Dj) (where u = Initially {-•fab) for 
some fab G Eab) such that -•lo G E but -'W ^ E 2 . Let 17 be the set of all 
these UJ, i.e., 17 = {-•uj G E : -•uj ^ E 2 }. Then if' = (if \ 17) U {w : -•uj G 17} 
is an extension of (lL(o,i>); Dv)- From Lemma 3.6.1 we conclude that if' is 
a potential extension of Let M' be an interpretation corresponding 

to if' such that M' is a model of {0,T>). From the construction of if' 
and the definition of correspondence it follows that M' contains strictly 
less abnormality assumptions than M , given that 17 is non-empty. Hence, 
M' -< M, which contradicts M being a preferred model. 

Having proved that E is an extension of {W(^o,t>), Dd) , it remains to be 
shown that it is prioritized. Let <Ci) be an arbitrary strict ordering induced 
by if . Furthermore, let if' be any extension of (lF(o,i>); L>x)) and M' bean 
interpretation corresponding to if' and which is a model of {0,T>). Suppose 
^fab' € Dj) is a default which is applied in if' \ E. Then ~^fab' G iies'([]) 
but fab G iies([]). Model M being preferred, we know that M' 7 ^ M. 
Therefore, we can also find some fab G Eab such that -•fab G iies([]) but 
fab G iies'([]). It follows that Sf^<^xi 5f^' (since <Cd is induced by if), that 
is, there exists a default which is preferred (wrt. <Ci)) to 5f^’ and which is 
applied in E\E' . 

a 5? . 

Let if be a prioritized extension of A(^q xi)- Then E is an extension of 
{W(^0,v)j Dv) and also, according to Lemma 3.6.1, a potential extension. 
Let M be an interpretation corresponding to E such that M is a model 
of (0,27). By contradiction, we prove that M is preferred. Suppose there 
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exists a preferred model M' of {0,T>) such that M' ^ M. This implies the 
existence of a corresponding prioritized extension E' of x>) according 
to the first part ( “ *^ ” ) of this proof. From M' -< M we conclude that there 
exists some Sj^i G Dx> which is applied in E' \E but no Sf^ G Dj) with 
higher priority and which is applied in E\E' . This contradicts E being 
prioritized extension. Qed. 

An immediate consequence of this one-to-one correspondence is that, as 
far as observations are concerned, the notion of skeptical entailment in pri- 
oritized default theories resulting from our axiomatization and the notion of 
entailment suggested by our action theory coincide. 

Corollary 3.6.2. Let {0,T>) he a qualification scenario with axiomatization 
^( 0 ,v)- An observation is entailed by {0,T>) iff the corresponding formula 
(c.f. (3.10) and (3.11), respectively) is skeptically entailed by Z\(o x>) • 

Proof. The claim is a consequence of Theorem 3.6.2 following the lines of the 
proof of Corollary 3.6.1. Qed. 



3.7 Bibliographic Remarks 

The Qualification Problem was introduced and so named in [76] as one of 
several arguments making manifest the urge for nonmonotonic representation 
and reasoning frameworks. The very article already anticipated the solution 
of introducing abnormality predicates which are to be (globally) assumed 
away by default. This solution was formally elaborated in [78] based on the 
by then existing nonmonotonic formalism of so-called circumscription [77]. It 
has first been observed in [64], however, that globally minimizing abnormal 
disqualifications of actions fails to suitably account for disqualifications that 
occur for reasons of causality. Rather than offering a solution, however. 

It is remarkable tbat tbe proposal put forth in tbe very article [78] to address 
tbe Frame Problem by globally minimizing change has been proved erroneous 
by a counter-example that shows some striking similarities to the refutation of 
global minimization as means to tackle the Qualification Problem. Suppose we 
consider abnormal any change of a fluent’s truth value during the execution 
of an action, as suggested in [78]. Then the Yale Shooting problem, which we 
already touched upon earlier in this book, arises as follows (c.f. [49]): Given 
that shooting at a turkey with a loaded gun causes the former to drop dead, 
we should expect exactly this to happen when we load the gun, wait for a short 
period, and then shoot. Yet globally minimizing abnormalities in this scenario 
produces a second model where the gun surprisingly becomes unloaded during 
the intermediary action of waiting and the turkey survives! While the magical 
change of the gun’s status is abnormal, the turkey surviving the shot is normal 
in the above sense — as opposed to the change of its life status in the intended 
model. Hence, this second model minimizes abnormality as well, though it 
is obviously counter-intuitive. The problem here is essentially that uniformly 
considering abnormal all changes is ill-dehned. A gun that becomes magically 
unloaded while waiting deserves being called abnormal but not the death of 
the turkey if being shot at with a loaded gun, which is perfectly normal from 




3.7 Bibliographic Remarks 113 



this paper marked the beginning of a shift away from the original concept 
of the Qualification Problem. Instead of being concerned with unlikely ac- 
tion preconditions to be assumed away by default, the Qualification Problem 
there has been taken as the task of extracting implicit action preconditions 
from general knowledge, e.g., state constraints — a task which we have already 
considered in the context of the Ramification Problem (see Section 2.8). The 
fundamental difference to the plain Qualification Problem is that any such 
implicit condition needs to be explicitly verified, hence cannot be assumed 
away prior to concluding that the action in question is executable. This rein- 
terpretation of the Qualification Problem has been taken up in subsequent 
work, e.g., [38, 68, 99]. There is no urge for arguing against the view that 
accounting for implicit action preconditions is part of some broader Qualifi- 
cation Problem. But denying that the core is to providing means to assume 
away, by default, the occurrence of a priori unlikely obstacles amounts to an 
oversimplification of the problem. 

One of the marginal consequences of dealing with the Qualification Prob- 
lem is that a property called “restricted monotonicity,” which has been 
claimed generally desirable for theories of actions in [67], is no longer so 
when facing qualification scenarios. A formalism possesses this property if 
additional observations can only increase the set of observations that are en- 
tailed by a domain description. This kind of monotonicity is obviously not 
appropriate in case of observations following by default. 

Exploiting solutions to the Ramification Problem when addressing the 
problem of caused vs. unmotivated action disqualifications has first been pro- 
posed in [110]. An earlier general alternative to global minimization, namely, 
chronological ignorance [102, 103], is in principle capable of treating correctly 
our key example with the potato being deliberately placed in the tail pipe 
(viz. Example 3.2.2), but the approach suffers from another, inherent limita- 
tion. Roughly speaking, the crucial idea underlying the principle of chrono- 
logical ignorance is to assume away, by default, abnormal circumstances but 
to prefer minimization of abnormalities at earlier timepoints.^^ Formally, a 
certain kind of modal logic is employed as a means to express the distinction 
between provable facts and propositions which might or might not be true. 
Our example domain, for instance, one would formulate in this framework by 
these two action descriptions: 

□ True (insert(a;), t) A □ True (-iin(x), t) A O True (-iheavy(a:), t) 

D □ True (in(x), t -I- 1) (3.15) 

the perspective of causality. The strong resemblance is apparent to the obser- 
vation that a potato being too heavy to lift is truly abnormal as opposed to 
the expectation that one will fail to start the car after attempting to place a 
potato in the tail pipe. 

This explains the naming: Potential abnormal disqualifications are ignored 
whenever possible, and this is done in chronological order. Assuming away 
obstacles whenever their occurrence cannot be proved, Shoham also calls the 
“ostrich” principle, or: “what-you-don’t-know-won’t-hurt-you.” 
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□ True (ignite, i) A □ True (-runs, t) A OVx. True (->in(x), i) 

D □ True (runs, i + 1) (3.16) 

where □ True (£, t) should be read as “at time t fluent literal £ provably 
holds” and O True {£, t) as “at time t fluent literal £ may or may not hold.” 
Thus the first of the two implications states that if it is known that some 
action insert(a;) occurs at time t, that ->in(x) holds, and that possibly 
-iheavy(x) holds at that time, then in(a;) holds at time t+1. Likewise, if 
it is known that the action ignite occurs at time t, that -runs holds, and 
that possibly Vx. -'in(a;) holds at that time, then runs holds at time t+1. 
Observe how regular action preconditions, like ->in(a;) and ->runs, must 
provably hold whereas abnormal disqualifying conditions, like heavy(x) and 
3a;. in(a:), are assumed away whenever the opposite does not provably hold. 
Now, suppose given □ True (-iin(pt), 1) A □ True (^runs, 1) in conjunction 
with the action occurrences □ True (insert(pt), 1) A □ True (ignite, 2) . 
Then chronological ignorance tells us that O True (->lieavy(pt), 1) holds 
since nothing is known about True (->heavy(pt), 1) itself. Hence, (3.15) 
implies □ True (in(pt), 2), which gives us -lOVx. True (->in(a;), 2) Thus 
the antecedent of the implication (3.16) is false (for t = 2) and, consequently, 
the second action, ignite, is correctly concluded unqualified. Notice that 
this being the unique suggested course of events relies on the chronologi- 
cal order in which minimization is performed. Otherwise, it could equally 
well be concluded that OVx. True (^in(a;), 2) holds, for, in the first place, 
nothing is known about Va;. True (^in(a;), 2) itself. This in turn entails 
-■OTrue (->heavy(pt), 1), i.e., □ True (heavy(pt), 1), since (3.15) is logi- 
cally equivalent to 

□ True (insert(a;), f) A □ True (-iin(x), t) A OTrue (-iin(a;), f + 1) 

+ □ True (heavy(a;) , t) 

This alternative conclusion corresponds to the counter-intuitive model ob- 
tained by global minimization of abnormalities (c.f. Section 3.2) but, as in- 
dicated, it is not supported by chronological ignorance. 

The interesting, albeit informal, reason for chronological ignorance com- 
ing to the desired conclusion in this and similar cases is a certain respect of 
causality hidden in this method. By minimizing chronologically, one tends to 
minimize causes rather than effects, which is the right thing to do, simply 
because causes generally precede their effects. On the other hand, the appli- 
cability of chronological minimization is known to be intrinsically restricted 
to domains and scenarios which do not involve indeterminate information. 
This has been shown for a variety of aspects of non-determinism; see, e.g., 
[60, 88, 104]. Informally speaking, the problem reveals whenever indetermi- 
nate information provides sufficient evidence for an abnormality without, by 
virtue of not being deterministic, necessitating it. Putting off abnormalities 
for as long as possible then ignores uncertain evidence and, in so doing, 

As usual in modal logic, -iD True (£, t) is equivalent to O True (->£, t) . 
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supports unsound conclusions. For illustration purpose, let us formalize a 
scenario mentioned by [88]. 

Example 3.7.1. Suppose we park our car overnight in a neighborhood that 
is known for its suffering from a tail pipe marauder. Then chances are good 
that the rascal has struck by the next morning so that we had better check 
for a potato in the tail pipe before trying to start the car. Let T> be the qual- 
ification domain of Example 3.2.2 but with action name insert replaced by 
stay-overnight®. The latter having a non-deterministic effect, it is specified 
by a pair of non-exclusive action laws, viz. 

stay-overnight transforms {} into {} 

stay-overnight transforms {-iin(pt)} into {in(pt)} 

Let O consist of the observation ^runsA-'in(pt) after [], then the qualifica- 
tion scenario {O.V) admits two categories of preferred models (i7, i?es), one 
of which satisfy -iin(pt) G iZes([stay-overnightj) while the others claim 
that in(pt) be true in i?es([stay-overnightj) . Hence nothing is entailed 
as to whether there is an abnormal disqualification of ignite or not after 
staying overnight. ■ 

Now to the problems of chronologically minimizing in this scenario. The fol- 
lowing formula specifies the new, non-deterministic action: 

□ True (stay-overnight, t) A □ True (->in(pt), t) , , 

D □ True (in(pt), t -I- 1) V □TRUE(-'in(pt),t-|- 1) 

That is to say, if at time t we stay overnight and there is no potato in the tail 
pipe, then at time t+1 this may or may not have changed. Consider this for- 
mula in conjunction with the specification of action ignite as given by im- 
plication (3.16), and suppose given □ True (->in(pt), 1) in conjunction with 
the events □ True (stay-overnight, 1) and □ True (ignite, 2) . Then im- 
plication (3.17) allows no definite conclusion about True (in(pt), 2), hence 
O True (-iin(pt), 2) following the principle of chronological ignorance, which 
in turn implies □ True (runs, 3) according to (3.16). Thus the conclusion 
that ignite be qualified at time 2 is sanctioned despite the possibility that 
the tail pipe marauder has struck by then. This undesired conclusion proves 
that the “what-you-don’t-know-won’t-hurt-you” principle is not suited for in- 
determinate information. While the Qualification Problem requires assuming 
away abnormal circumstances whenever they do not provably hold, this is in 
general too credulous if the performance of a non-deterministic action ren- 
ders quite possible such circumstances. A “what-you-can’t-expect-won’t-hurt- 
you” principle would be appropriate — clearly, an abnormal disqualification of 
ignite after carelessly leaving unattended the car in the dangerous neigh- 
borhood is to be expected from one of the possible effects of staying overnight, 
which is why this potential disqualification ought not be assumed away. 

Exploiting ramification as the key to the Qualification Problem shares 
with Motivated Action Theory (MAT, for short) [104, 2] the basic insight 
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that an appropriate notion of causality is necessary when assuming away ab- 
normalities. In the latter framework, event happenings are minimized while 
taking into account the possibility that their occurrence is being caused (or, 
in other words, motivated, hence the name). This excluding unmotivated 
events and our minimizing uncaused abnormal disqualifications are somehow 
complementary while being based on the same principles. However, some me- 
thodical criticism and limitations apply to MAT: An unsatisfactory property 
of the underlying preference criterion, i.e., motivation, is its depending on 
the syntactical structure of the formulas representing causal knowledge. As a 
consequence, logically equivalent formalizations may induce different prefer- 
ence criteria, of which only one is the desired. Moreover, the formal concept 
of motivation becomes rather complicated in case of disjunctive, i.e., inde- 
terminate information, which entails difficulties with assessing the range of 
applicability of MAT in general. Finally, events can only be ‘motivated’ by 
past events, which does not allow for telling apart caused events that occur 
concurrently with the triggering event. (In passing, we mention that this last 
restriction also applies to a version of the Event Calculus (see below) if events 
are allowed to trigger other events, as proposed in [98]). In comparison, in 
[II5] it is illustrated that ramification can also be successfully applied to the 
problem of minimizing event occurrences. 

Talking about the Event Calculus (introduced by [61]; for an overview 
and thorough analysis see [100] and the monograph [99]), dealing with the 
Qualification Problem there requires to respect a fundamental difference be- 
tween this approach and, say, both Situation Calculus and Fluent Calculus. 
Namely, the latter are based on a branching time structure where different, 
hypothetical action sequences may fork left and right of the actual time line, 
if any. In contrast, the Event Calculus grounds on a linear time structure 
thus representing solely the actual evolution of the world. The occurrence 
of actions is specified by assertions of the form Happens {A, T) stating that 
action A is performed at time T. A consequence of the linear time struc- 
ture is that once an assertion of this form is made, it cannot be withdrawn. 
This entails difficulties with formally representing observations of abnormal 
action disqualifications if these are interpreted as the action being physically 
impossible — which is the interpretation we pursued throughout our analysis 
of the Qualification Problem. An alternative view seems better suited for the 
Event Calculus or, for that matter, generally for action theories based on a 
linear time structure. Instead of tightly coupling an action with its success, 
such as starting the car with the effect that the engine is running, performing 
an action may first of all be taken as the mere intention to achieve a certain 
effect. The actual achievement then depends on both regular conditions be- 
ing met and the absence of abnormal, disqualifying circumstances. We do 
not wish to provide a detailed formal account of the Qualification Problem 
in the Event Calculus here, but let us mention that this is accomplishable by 
introducing a predicate Succeeds{a,t) meant to indicate that action a hap- 
pening at time t produces the intended effect. Abnormal conditions denying 
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the success of an action can then be dealt with along the lines of our action 
theory. 

Default Logic [85], the formalism in which our Fluent Calculus axiomati- 
zation of Chapter 2 has been embedded in view of the Qualification Problem, 
was among the earliest nonmonotonic frameworks, the key reference to many 
of which is the special volume [11]. Numerous modifications and extensions 
to classical Default Logic have subsequently been developed in order to cope 
with a number of ontological aspects missing or arguably being counter- 
intuitively dealt with in the original approach; a variant which uniformly 
addresses many of these aspects is presented in [19], just to mention one. 
However, the default rules occurring in our axiomatization bear a very spe- 
cific structure so that, fortunately, none of the criticisms of classical Default 
Logic applies (speaking in technical terms, our defaults are, without excep- 
tion, both “prerequisite-free” and “normal”, which means they all are of the 
form Prioritized Default Logic has been introduced in [15] to support 

the notion of preference among default rules. We have adopted the more ele- 
gant reformulation proposed by [87] , who also points out some problems with 
the original approach for certain classes of default theories — but again our 
particular default theories are not subject to this criticism due to their special 
structure. The approach [13] provides the first formal framework for priorities 
among unrestricted default rules. A variety of approaches to the automation 
of reasoning in default logics exist, e.g. [85, 7, 97, 96], just to mention a few. 
Those calculi which perform proof search in a local fashion, that is, which do 
not necessarily require the generation of entire extensions of an underlying 
default theory, are usually, however, restricted to so-called credulous reason- 
ing. In contrast to skeptical reasoning, the latter entails any formula that 
belongs to (at least) one extension. A general method for extending credu- 
lous proof procedures to skeptical entailment without loosing the property of 
being local and goal-oriented has been developed in the second part of [106]. 

For rounding off this chapter on the Qualification Problem some remarks 
seem appropriate on the topical debate about nonmonotonic vs. probabilistic 
logics. While the former amounts to qualitative reasoning about small like- 
lihoods, the latter is concerned with quantitative reasoning on the basis of 
exact knowledge of probability values. The strongest argument undermining 
the fundamental of all nonmonotonic frameworks is that sometimes counter- 
intuitive, if not contradictory, conclusions are unavoidable. This is best illus- 
trated with the well-known Lottery Paradox introduced in [62] (see also [84]): 

“In a fair lottery with 100 tickets the chance that any given ticket will 
lose is 99 per cent. It therefore seems reasonable, for any given ticket, to 
believe, or accept as a basis for action, the statement ‘This ticket will lose.’ 

Yet the conjunction of such statements for all tickets must be false, since 
some ticket will win, so we can hardly accept the conjunction. We seem 
to have to accept each statement separately, but not the conjunction of 
them.”i® 



[63], pp. 187-8 
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Scenarios like the foregoing clearly favor the use of probabilistic theory when 
it comes to decision-making based on the available information. On the other 
hand, it seems that only for a small fraction of daily-life situations such exact 
(conditional) probabilities can be provided. This is the strongest argument 
against probabilistic reasoning as an alternative to nonmonotonic logics. 
The relevance of this argument has revealed in the course of a research project 
devoted to formalizing large parts of human common sense knowledge: 

“Early on, we allowed each assertion in the knowledge base to have a nu- 
meric certainty factor, but this approach led to its own set of increasingly 
severe difficulties. [. . .] There wasn’t statistical data to support these num- 
bers [...]. These problems led us to go back to a simple nonnumeric scheme 
in which [assertions! [. . .] would simply be true (nonmonotonically, that is, 
true by default) [. . 

It SO seems that the pragmatic answer to the problem of which approach pro- 
vides the ‘right’ formalism, nonmonotonic or probabilistic logic, is to com- 
promise: Exact probability values should be used whenever both available 
and appropriate — as it is the case in the scenario described by the Lottery 
Paradox, for instance. When it comes to representing knowledge involving ab- 
normalities which are only vaguely known to be unlikely, choosing a nonmono- 
tonic framework helps to base decision-making on reasonable conclusions.^® 



Another argument often brought forward, namely, that probabilistic reasoning 
is computationally intractable, has been challenged by the notable practical 
success of Bayesian Belief Networks [81], which exploit knowledge of causal 
independence to speed up the reasoning process. 

[46], pp. 34-5 

To complete this brief discussion, successful attempts to embed nonmonotonic 
reasoning in logics of probability should be mentioned, e.g., [42]. 
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4.1 State Constraints with Exceptions 

Our account of the Qualification Problem was made possible by the foregoing 
solution to the Ramification Problem. The fact that abnormal circumstances 
can be brought about as side effect of performing certain actions is best 
accommodated, so our argument went, if these side effects are obtained just 
like all indirect effects are, that is, through ramification. In this concluding 
chapter, we will expand even further the connection between the Ramification 
and Qualification Problem. This time it is argued how solutions to the latter, 
and in particular the one we pursued, can in turn be successfully employed 
to address a generalization of the former. 

The motivation for this generalization of the Ramification Problem is 
inherited from the Qualification Problem. If actions in non-artificial envi- 
ronments may turn out unqualified for some abnormal reason, then so may 
ramifications, too. Recall, for instance, the introductory electric circuit from 
the beginning of Chapter 2, which connects a battery with a switch and a 
light bulb (c.f. Fig. 2.1). It has been said that the light is on if and only if 
the switch is closed. Formally, light = up(s]^). This state constraint, so the 
argument went, gives rise to indirect effects, e.g., the bulb is expected to light 
up once S;]^ is switched into the upper position. Taking this as eternal truth 
is appropriate for an idealistic version of the circuit. But the expectation that 
the light can be switched on in this way in a real circuit depends on addi- 
tional, tacitly presupposed conditions, e.g., the bulb must not be broken, the 
battery not be malfunctioning, and the wiring needs to be in order. If the 
belief is held that all of these conditions are usually satisfied, then we face 
a situation similar to the Qualification Problem. Namely, abnormal circum- 
stances, now denying the universality of a state constraint and, hence, the 
occurrence of the indirect effects it is supposed to trigger, need to be assumed 
away — but again only to a reasonable extent. 

The remainder of this chapter is devoted to this problem of accounting 
for exceptions to state constraints, as we call it. It will turn out that this end 
merely requires the adoption of the insights gained in the context of the Qual- 
ification Problem. This means that rather than newly extending the action 
theory we finally arrived at, it suffices to pursue a specific design strategy for 
state constraints that admit exceptions. Treatment of these exceptions, which 
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is identical to that of abnormal action disqualifications, is then determined 
by the model theoretic component of our action theory as it stands. 



4.2 Disqualified Causal Relationships 

Each state, including the initial one, resulting from the performance of action 
sequences is supposed to be acceptable, that is, to satisfy all state constraints. 
This implies that every constraint, as it stands, is universally valid, hence so 
is the occurrence of indirect effects it triggers. In order to account for possible 
exceptions to the strict truth of a state constraint, the latter is to be suitably 
weakened by restricting its range of applicability to normal circumstances. 
Formally, this is accomplished by rewriting a constraint C as ~'abc 3 C , 
where ahc is a new fluent which, if true, shall indicate that there happens 
to be an exception to C . Validity of the state constraint is thus confined to 
‘normal’, in a specific sense, states. 

Any restriction to the applicability of a state constraint should of course 
be transmitted to all corresponding causal relationships. If an exception to 
a constraint occurs, then the indirect effects it usually triggers are no longer 
expected. The following proposition shows that this comes for free if causal 
relationships are automatically extracted from the modified state constraints 
following the guidelines of Section 2.5. 

Proposition 4.2.1. Let £ and T he sets of entities and fluent names, re- 
spectively. Furthermore, let C he a variahle-free state constraint and I some 
influence information. If TZ and TV are the output of the generation proce- 
dure depicted in Fig. 2.6 on input (C\I) and {-'ahc 3 C\I), respectively, 
then 



e causes p if ^ G 7^ implies e causes p if ^ A -•ahc G 'hZ' 

Proof. Let C\ A . . . A C„ be the minimal CNF of C, then the minimal CNF 
of -•ahc D C is {C\ V ahc) A ... A {Cn V ahc). Let 

causes if ->£i 

I = 1, . . . , m 
I ^ 3,1 k 

be any member of TZ, then there exists some Ci = i\ \/ . . . y Im (where 
1 < i < n) such that (||t?j||, |Kfc||) G X and I < j ^ k < m. Accordingly, the 
disjunct V . . . V V ahc determines the causal relationship 

-•£j causes £k if f\ -•£i A -•ahc 

I = 1, . . . , m 

I 3,1 ^ ^ 



in TZ'. 



Qed. 
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^up(si) 



~.up(s2) 




-iwiring-problem 



Figure 4.1. In this electric circuit, a light bulb is involved in two otherwise in- 
dependent sub-circnits. The one on the right hand side inclndes a battery which 
is supposed to be so powerful — under normal circumstances — that it destroys the 
bulb as soon as switch S 2 gets closed. Normally, the two batteries, the wiring, and 
the light bulb are in order. 



It is straightforward to verify that this result is transferable to the gener- 
alizations of the algorithm of Fig. 2.6 developed in the section on influence 
information (Section 2.5). 

It is clear that the purpose of introducing a notion of abnormality into 
state constraints is to assume normal circumstances by default. In the con- 
text of the Qualification Problem it proved vital, to this end, to account for 
the fact that abnormalities may be caused by the performance of actions. 
It should not come as a surprise if something similar is observed in case of 
qualified ramifications, and indeed the following scenario shows how excep- 
tions to state constraints can wilfully be brought about. Global minimization 
of abnormalities is therefore inappropriate as regards state constraints and 
indirect effects, too. 

Example 4- 2.1. Suppose the light bulb is involved in a second, otherwise in- 
dependent sub-circuit consisting of its own battery and switch, see Fig. 4.1. 
Suppose further that the new battery, without an intermediate resistor, is too 
powerful for our light bulb so that the latter gets broken as soon as current 
flows through the rightmost sub-circuit. 

As a model of this circuit, let T> be the ramification domain consisting of 
entities and S 2 ; fluent names up^, light*^, broken^, wiring-problem 
malfunc;]^*^ , malfunc 2 ° , ab^, and o& 2 °; and action name toggle^ along 
with the action laws 

toggle(a;) transforms {up(x)} into {-'up(x)} 

toggle(a;) transforms {-'up(a:)} into {up(a;)} 



The set of state constraints shall divide into two non-steady, namely. 




122 4. Qualified Ramifications 



-•abi D [light = up(s]^) ] 

-■0&2 3 [^ip(s2) ^ broken] 

in conjunction with these three steady ones: 

broken V malfunc;]^ V wiring-problem D -ilight 
abi = broken V malfunc;!^ V wiring-problem 
0&2 = malfunc2 V wiring-problem 

Let the underlying influence information be 

I = { (up(sj^), light), (broken, light), (malfunc;]^, light), 

(wiring-problem , light), (up(s2), broken), (malfunc2, broken), 
(wiring-problem , broken), (broken, abi), (malfunc;]^, o&i), 
(wiring-problem , o&i), (malfunc2, 062)5 (wiring-problem , 062) } 

This determines the following causal relationships, all of which except for the 

first three are steady: 
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Consider, now, a situation where we only know that both switches are open 
(as depicted in Fig. 4.1). What would be the predicted outcome of first tog- 
gling S2, followed by Since nothing hints at a malfunctioning battery or 
bad wiring, we expect that up(s2) D broken be true and, hence, broken is 
an indirect effect of closing S2. Consequently, closing S;]^ afterwards should 
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fail to produce light. But globally minimizing abnormality in this scenario 
does not yield this conclusion. Obviously, some abnormality is inevitable. 
One minimal model is therefore given by -'0&2 with regard to the first ac- 
tion, and ah I with regard to the second. This reflects the expected course of 
events. Yet we can just as well assume the first ramification unqualified (i.e., 
o&2)j which then would avoid the necessity of assuming a disqualification of 
the following ramification (i.e., -lO&i). For if the bulb does not break as a 
consequence of toggling S2, then light turns on as the usual indirect effect 
of toggling afterwards. This gives us a second, unintended model, where 
the more powerful battery is supposed down and light is on in the end. ■ 



4.3 Causing Exceptions to State Constraints 

Global minimization being inappropriate when assuming away exceptions to 
state constraints, too, our solution to the Qualification Problem furnishes a 
ready approach to satisfactorily tackling abnormalities in the new context. 
Since exceptional circumstances can be brought about as side effect of per- 
forming actions, fluents representing these circumstances should be made 
subject to both ramification and, otherwise, persistence. Being a priori un- 
likely to occur, these fluents are to be minimized initially to the largest rea- 
sonable extent, just like the fluents that describe abnormal disqualifications 
of actions. 

To summarize, the amalgamation of the Ramification and Qualification 
Problem is addressed as follows. Each state constraint C which admits ex- 
ceptions is replaced by the weaker fluent formula ahc C, with ahc being 
a distinct new fluent name. The reading of the modified constraint is that 
now C holds only under normal circumstances. As for the case of action 
disqualifications, the fluent ahc should be engaged in additional state con- 
straints defining conditions for an exception to C. In order to reflect that 
ahc indicates an exceptional situation, this fluent is subject to minimization, 
i.e., belongs to the set of abnormality fluents of the domain at hand. 

Example 4 - 3 - 1 - Let T> be the qualification domain of Example 4 . 2 . 1 , and let 
Tab = {malfunc;!^, malfunc2, broken, wiring-problem, o&i, 0&2} 

Let E be the transition model of V, and suppose O consists of the obser- 
vation 

->up(s]^) A -'up(s2) after [] 

Since no abnormality needs to be granted, the qualification scenario { 0 ,T>) 
admits a unique preferred model {E,Res) where ->up(s j^), -iup(s2) G Res([]) 
and Res([]) C\Tab = {}• Performing toggle(s2) in Res([]) has the direct 
effect up(s2) and the indirect effects broken and, hence, ab\ according to 
the following causal relationships. 
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up(s 2 ) causes broken if -'a &2 
broken causes ah\ if T 

From a&i G i?es([toggle(s 2 )]) it follows that toggling in that state fails 
to produce the otherwise expected indirect effect light. Hence, the only 
acceptable state which can be assigned to i?es([toggle(s 2 ), toggle(s j^)]) is 

{up(si),up(s2), -■light, 

broken, -■wiring-problem , -■malfunc;!^, -■malfunc 2 , a&i, ~'ob 2 } 
Consequently, {0,V) entails the observation 

flight after [toggle(s 2 ), toggle(s;l^)] 



as expected. 



4.4 Bibliographic Remarks 

By the time this book was written the problem of ramifications having excep- 
tions had received little attention in literature, presumably because satisfac- 
tory solutions to the Ramification Problem itself had not emerged until very 
recently. The proposal to combine solutions to both the Ramification and 
Qualification Problem has been made in [112, 114]. About the only other pa- 
pers on that topic are [5, 123]. In both of them expressions resembling causal 
relationships are allowed to be defeasible. The former presents a formal action 
theory, whose semantics is, however, indirectly defined via a translation into 
extended logic programs and by appealing to the notion of answer sets [32]. 
The latter does not go beyond defining a notion of successor state based on 
minimizing abnormality. In order to successfully cope with qualified ramifi- 
cations, therefore, the general approach needs to be adopted of minimizing 
abnormality in the initial state so that persistence and ramification take care 
of the further evolution. 
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